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Preface 


I started this text some time ago, just after my PhD, when I began teaching abstract 
algebra to university undergraduate mathematics majors. It was a way to organize 
my lectures and not have to rethink my presentation from scratch each time I taught 
the course (obviously I am not unique in this idea). However, later I used the text 
as a way to record interesting topics, examples, techniques and applications and to 
answer those questions that were often referred to, but often without providing any 
mathematical proof. This text became my algebraic memoirs and a compendium of 
basic notions of abstract algebra I collected during my journeys through this field. 
This journey still continues even as I write this preface. For instance, applications in 
neural networks that are group equivariant and reinforcement learning and a partially 
observable Markov decision process (POMDP) which utilizes group representation. 
Abstract algebra is finding its way into a myriad of important real world (whatever 
that means) applications. 

I remember my first encounter with abstract algebra as an undergraduate math- 
ematics major, a standard course in the major. It was this mathematics course that 
I instantly fell in love with, and knew even then that this would be the area of study 
I would pursue as a graduate student in pure (a very pretentious adjective) mathe- 
matics. I even remember that day in class when I was overwhelmed with amazement 
by the abstraction this topic offered. We were using Fraleigh’s text, A First Course 
in Abstract Algebra, and the topic was discovering when group cosets formed a well- 
defined group in and of themselves, and this led to the notion of normal subgroups. I 
had a terrible instructor for that course and wound up reading the text on my own, 
and probably learned more than I ever would have even with a good teacher. 

This text spans more material than a standard one-year course in abstract algebra. 
However, the text contains as a subset the standard material for a one-year class. It 
is self-contained, in the sense that it has all the necessary background for the topics 
it presents. It is highly recommended that the student has had a course on how to do 
mathematical proof, however I would be hypocritical if I insisted on this, not having 
taking the course myself. Some very basic topics are presented in the first chapter 
that are foundational for what is to follow and is highly recommended, although most 
can be found in that introduction to mathematical proof course I mentioned. Even if 
you have seen this material, it would not hurt to cover it twice. The text is divided 
into two parts — groups (which typically covers one semester), then rings and fields 
for the second semester. There are some who argue that the two parts should be 
reversed, however I decided to start with groups, since there are less axioms involved 
for this algebraic structure and perhaps this makes it more manageable as a first 
topic in abstract algebra. 


xii M@ Preface 


Here are my suggestions for a first pass through the text, although, of course, the 
instructor of this course would know best what to cover. As already stated, Chapter 
1 is a good idea. Most of Chapter 2 is also fundamental, except perhaps Section 
2.9.1 on semi-direct products. Chapter 3 can be skipped without any disruption in 
the continuity and reflects that aspect of the text which is more of a compendium 
of information. I see Chapter 4 as critical which hopefully the instructor can reach 
by the end of the semester. It is a wonderful application of group theory. Section 
4.7 can be omitted. Chapter 5 and 6 can also be skipped on a first pass, although 
the information therein is important. So far as rings and fields are concerned all 
of Chapter 7 is essential, as well as Chapter 8 which is typically covered in a first 
course in abstract algebra. In Chapter 9, one can skip 9.3 (although an interesting 
application if you are interested in such things). Section 9.5 is more theoretical and 
may be skipped. That should finish one year of abstract algebra, but for those who 
want to delve in Galois theory, say in a follow on course, Chapter 10 makes for such 
a course (perhaps with a review of Chapter 9). 

Thank you for considering this text for your adventures in abstract algebra. I 
hope you discover the beauty in this topic just as I did some time ago. 


I 


Groups 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


CHAPTER I 


Background Material 


N THIS CHAPTER, we present a collection of topics which serve as a background for 
| the ideas you will see in this text. In Section 1.1, we present equivalence relations 
which foreshadow many important structures in abstract algebra such as the cosets 
of a group, the orbits of a group action, and much more. In Section 1.2, we present 
basic ideas about functions since all the structures presented in this text will have 
special functions associated with them. Section 1.3 will highlight some fundamental 
properties of the integers as a prelude to the more general topic of rings. In Section 1.4, 
we describe in more detail congruence modulo n which is an equivalence relation with 
additional properties. The equivalence classes of this relation produce one of the basic 
examples of a group and of a ring. Although, not explicitly presented, ideas in basic 
set theory are prevalent in this text and are an important background topic. 


1.1 EQUIVALENCE RELATIONS 


The notion of a relation on a set is important in many fields of mathematics. We 
shall see many applications of a particular type of relation (called an equivalence 
relation) in this text. We start by defining a relation and then narrow things down 
to an equivalence relation. 


Definition 1.1 A relation ~ ona set A is simply any subset of the cartesian product 
Ax A, i.e. a collection of ordered pairs of elements of A. 


If (a,b) is an element of ~, we instead write a ~ b and we say a relates to b. 


Example 1.1 Here, we list a number of examples of relations. 


1. Let A = {a,b,c,d} and let ~ be the subset {(a, b), (b, 6), (c,d)} of A x A. For 


instance, according to our definition of ~, we have c~d or c relates to d. 


2. Let A= Z (the set of integers) and ~ be <. In other words, (n,m) is in ~ or 
n~m exactly when n <m. 


3. Set A= P(Z) which represents all the subsets of Z (called the power set of 
Z). Let ~ be C, i.e. subset. In other words, a subset X relates to another subset 
Y of Z exactly when X CY. 
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4. Take any set A and let ~ be equality, i.e. a ~ b exactly when a = b. In other 
words, ~ is the set {(a,a) : a€ A}. 


5. Let f : A— B be a function from a set A to another set B. Define a relation 
on Aas follows: a~b iff f(a) = f(b). 


6. Let A= Z and define ~ as follows: n~m iff there exists an integer k such 
that m = nk. One says that n divides m and we write n|m. For instance, 
(3,15) is in ~, since 3 divides —15 because —15 = 3(—5). 


7. Define a relation on the set Z as follows: Fix a positive integer n and define 
m~k iff n\(m—k). This relation is called congruence modulo n, and in 
place of m~ k, we typically write m =, k orm =k (mod n). Section 1.4 is 
dedicated to this important relation. 


8. Define a relation on Q as follows: § ~ 5 iff ad = bc. So for instance (, =3) 
is in ~ or 5 ~ =3, since (1)(—6) = (2)(—3). 


There are various properties one may wish to investigate with respect to a relation. 
We list a few below. 


Definition 1.2 Let ~ be a relation on a set A. We say ~ is 
1. reflexive if for alla € A, we have a~a. 
2. symmetric if for alla,b € A, we have a ~ b implies b~ a. 
3. transitive if for all a,b,c € A, we havea~b andb~c implies a~ c. 
4. irreflexive if for alla € A, we havea a. 


5. anti-symmetric if for alla,b € A, we havea ~ b and b~ a implies a = b. 


Some examples of types of relations that are of particular importance in mathematics 
are the following: 


Definition 1.3 Let ~ be a relation on a set A. We say that ~ is 
1. a partial ordering of A if it is reflexive, anti-symmetric and transitive. 
2. an equivalence relation on A if it is reflexive, symmetric and transitive. 


3. a function on A if whenevera~b andaw~c, then it must be that b = c. 


Example 1.2 We see now that C is a partial ordering on P(Z) and =p, is an equiv- 
alence relation on Z. Note that if we restrict the relation divides to a relation on 
positive integers, then it becomes a partial ordering on positive integers. 


The focus of our discussion for the remainder of this section is equivalence rela- 
tions. Let’s list here the examples introduced already which are equivalence relations. 
The reader should take the time to prove that they are indeed equivalence relations. 
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Example 1.3 Here are some examples of equivalence relations. 
1. Take any set A and let ~ be equality, i.e. a~ b exactly when a = b. 


2. Let f: A B be a function from a set A to another set B. Define a relation 
on Aas follows: a~b iff f(a) = f(b). 


&. Define a relation on the set Z as follows: Fix a positive integer n and define 


m~k iff m=k (mod n). 


4. Define a relation on Q as follows: § ~ 4 iff ad = be. 


Example 1.4 Here are some equivalence relations specific to linear algebra for those 
familiar with the material. 


1. Matrix equivalence is an equivalence relation on the set of m x n matrices; i.e. 
for twom xn matrices A and B define A~ B iff there exist a finite number 
of elementary row operations which convert A into B. 


2. Matrix similarity is an equivalence relation n x n matrices; t.e. for twon x n 
matrices A and B define A~ B iff there is an invertible matrix P such that 
BP AAP, 


8. Isomorphism is an equivalence relation on the set of vector spaces; i.e. two 
vector spaces V and W relate iff they are isomorphic. For instance, R® is 
equivalent to P:, since 3-tuples are isomorphic to polynomials of degree 2 or 
less (because they have the same dimension, namely 3). 


As we have stated already, for the remainder of this section, we will be assuming 
that ~ is an equivalence relation, and as such, we typically use the notation = in 
place of ~. 


Definition 1.4 Let = be an equivalence relation on a set A anda € A. The equiv- 
alence class of a with respect to =, written 


[alJ=] equals {bE A : a=D}}. 


The element a is sometimes called a representative of the class |a|=. The collection 
of all equivalence classes of A with respect to =, in other words {|aj= : a € A}, is 
denoted by A/ = and is called the quotient set of A. 


At times we will simply write [a] in place of [a]= when the equivalence relation 
is understood, and we may simply call |a] the class of a for brevity. Some other 
notation for an equivalence class which the reader may encounter here or in other 
texts is @ in place of [a]. 


Example 1.5 Let’s compute some equivalence classes for the examples already pre- 
sented. 
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1. The equivalence classes for equality on a set A are singleton sets, i.e. [a] = {a}, 
since no other element besides a relates to a. 


2. For the equivalence relation we defined on Q (see Example 1.8.4), an equivalence 
class represents all the different ways we can represent a particular fraction. For 
instance, the equivalence class 


1). f1. =-3 12 
2] \2’-6’ 24"°"s° 
8. Consider the equivalence relation congruence modulo 3 on Z. There are exactly 


three distinct equivalence classes. Each class contains integers which when di- 
vided by 3 yield the same remainder. 


[0] = {0, +3, +6,...} 


Fl Ae ca tA a 
A ar 18 React 


One can view equivalence relations as a generalization of equality. Each class in a 
sense contains all the elements of a set which we view as being the same. Just consider 
the example of the equivalence class of 5. In practice, we view 5 and = as being 
the same even though symbolically they look very different. Equivalence classes are 
simply a formal way of equating things which we wish to view as being equal. 

We now prove a result which uncovers the essential properties of an equivalence 
relation. 


Lemma 1.1 Let = be an equivalence relation on a set A. 
1. For alla € A, we have a € [a]. 
2. For alla,b € A, we have [a] = [b) iff a=b. 
3. For all a,b € A, either [a] = [b| or [a] N [b] = 9. 


Proof 1.1 The first part follows immediately from the reflexive property. For the 
second part, assume first that [a] = [b]. Now since a € [a], we have a € [b] and so 
by definition and symmetry a = b. Now assume that a = b. Using transitivity and 
symmetry, notice that c € [a] iff a=c iff c=b iff cé [b] and so |{a] = [b). For the 
last part, either [a] = [b] or |a] ¥ [b). In the latter case, we show that [a] and [b) are 
disjoint, thus proving the result. Indeed, we prove this by proving the contrapositive. 
Therefore, assuming [a] [b] £0, there is some c € [a] MN [b]. Then c € [a] and c€ |b 
and soc =a andc= b. Using symmetry and transitivity, we have a = b and so by 
the second part [a] = [b]. 


Notice that the second part of the lemma says that any element of a class can 
represent that class, i.e. if b € [a], then [b] = [a]. The first and third part of the lmma 
says that equivalence classes divide the set A into a union of disjoint subsets. 
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Figure 1.1 A partition is a puzzle. 


Example 1.6 Let’s consider what equivalence classes look like in the case of our 
linear algebraic examples of an equivalence relation. 


1. Consider the equivalence relation on Mn matrix equivalence. Each equivalence 
class can be represented by a reduced row-echelon form matrix. 


2. Consider the equivalence relation similarity on n x n diagonalizable matrices. 
Each equivalence class can be represented by a diagonal matrix. 


&. Consider the equivalence relation isomorphism on finite dimensional vector 
spaces. Each equivalence class can be represented by R”, i.e. if V represents 
the collection of all finite dimensional vector spaces, then 


V/= equals {[R"] | n=0,1,2,...}, 
if we allow R° to be the set {0}. 
Let’s formally define this notion of a partition of a set. 


Definition 1.5 Let A be a non-empty set and P be a family of non-empty subsets 
of A. We say P is a partition of A or P partitions A if 


1. For alla € A, there is an X € P such thatae X. 


2. For all X,Y € P distinct, we have XNY =9. 


One can think of a partition of a set as a puzzle (see Figure 1.1) where each puzzle 
piece is an element of the partition, and when you put all the puzzle pieces together, 
you get set A. According to this formal definition, we see from the lemma that A/ = 
is a partition of A. 


Example 1.7 Consider the earlier example of congruence modulo 8 an equivalence 
relation on Z. The partition into equivalence classes, namely Z/ =3, consists of three 
puzzle pieces, namely [0], [1] and [2]. These three classes are pairwise disjoint and 
their union is all of Z. 


8 m Fundamentals of Abstract Algebra 


EXERCISES 

1 For the examples in Example 1.1, list three elements in each relation. 

2 State and prove which of the properties in Definition 1.2 has each of the exam- 
ples in Example 1.1. 

3 Let A be the set of integers and ~ be the relation <. State and prove which of 
the properties in Definition 1.2 this relation has. 

4 Verify that C is a partial ordering on P(Z) (you may use your work in Exer- 
cise 2). 

5 Verify that =, is an equivalence relation on Z (you may use your work in 
Exercise 2). 

6 Verify that if we restrict the relation divides to a relation on positive integers, 
then it becomes a partial ordering on positive integers. 

7 Verify that the relations listed in Example 1.3 are equivalence relations (you 
may use your work in Exercise 2). 

8 If f : R > R is the function defined by f(x) = 27, describe the equivalence 
classes of Example 1.3.2. 

9 For each of the following relations on Z, decide if it’s an equivalence relation. If 


10 


it is, then verify; otherwise, give a counter example. 


a. awd iff ja—b] <3. 
b. a~wb iff 2\(a+ 0). 
c. a~wb iff 3\(a4+ 0). 
d. a~wb iff ab<0. 


e. awd iff ab> 0. 


For each of the following relations on a set A, decide if it’s an equivalence 
relation. If it is, then verify; otherwise, give a counter example. 


a. A is the set of all English words. For x2,y € A define x ~ y iff x has a letter 
in common with y. 


b. A= R? and for (a,b), (c,d) € A define (a,b) ~ (c,d) iffa+d=b+c. 


c. Let A= {2,3,4,...}. For m,n € A define m ~ n iff there is a prime p such 
that p|m and p | n. 
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11 Define the following relation on R?: (a,b) ~ (c,d) iff a+b =c+d?. 


a. Verify that ~ is an equivalence relation. 


b. Describe geometrically an equivalence class for ~. 


12 For each of the following relations, verify that it is an equivalence relation and 
then describe clearly a typical equivalence class. 


a. A is the set of all differentiable functions in one variable and f = g iff 
f(x) = g(a). 
b. A=R? and (a,b) = (c,d) iffa=c. 


13 Suppose a relation ~ on a set A has the following two properties: 


a. Forallac A, a~a. 


b. For all a,b,c€ Aifa~band bw~c, then c~a. 


Prove that ~ is an equivalence relation on A. 


14 Consider a circle divided into four equal sectors. 


a. If we can color each of the four sectors either black or white, how many 
different colorings are there? 


b. Let A be the set of colorings in part a, and define a relation as follows: Two 
colorings of the circle are equivalent if you can get from one to the other by 
rotating the circle by either 0°, 90°, 180° or 270°. Prove that this relation is, 
in fact, an equivalence relation. 


c. List the set of equivalence classes for this equivalence relation. 
15 Prove the following statements: 


e Given a partition P of a set A, the relation defined by a = b iff there is an 
X €P such that a,b € X defines an equivalence relation whose equivalence 
classes consist precisely of the elements of P. 


e Conversely, if one starts with an equivalence relation = on a set A and 
forms the partition into equivalence classes, and then defines an equivalence 
relation as we just did in the previous bullet, then we wind up with the same 
equivalence relation as we began with. 


1.2 FUNCTIONS 


In this section, we present the facts about functions that we use from time to time 
in the course of this text. We start with the definition. 


Definition 1.6 Let A and B be sets. 
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1. The cartesian product of A and B, written 


Ax B={(a,b) : a€A and be B}. 


2. A relation R from A to B is any subset of A x B. 


3. A function (or map) f from A to B, written f : A > B, is a relation from 
A to B with the added properties that for each a € A, 
e there is some b € B such that (a,b) € f, and 
e if (a,b), (a,c) € f, thenb=c. 
4. Given a function f from A to B, set A is called the domain of f and the set 


B is called the codomain of f. The range (or image) of f, written f(A) = 
{bE B: (a,b) € fh}. 


5. Given a function f from A to B, if (a,b) € f, then element a € A is called an 
input for f and the element b € B is called an output for f. We employ the 
notation f(a) = b in this situation. 


6. Two functions f and g from A to B are equal if they are equal as sets. In 
other words, for alla € A, we have f(a) = g(a). 


Example 1.8 Here, we present some specific examples of functions. 


1. Let A = {a,b,c}, B = {1,2,3} and define f : A > B by f = 
{a3 tbs T) fe 1). Tren fa) = 3. f(b) = 1, (6) = and Ry 3h 


2. Functions can also be defined in terms of a formula. For instance, we can define 
f: ZZ by f(x) =2?. In which case 


Ff —={(0,0), (8,0, 24) Hee 


We will move away from the ordered pair representation of a function, i.e. (a,b) € 
f, and make use of the notation f(a) = b from now on. 


Definition 1.7 Let f : A— B be a function. 


1. f is injective (or one-to-one) if f(a1) = f(a2) implies a, = ag. In other 
words, in terms of the contrapositive statement, two different inputs cannot 
yield the same output. 


2. f is surjective (or maps onto B) if for every b € B, there is ana € A such 
that f(a) = b. In other words, the image and codomain are equal. 


&. f is a bijection if it is both injective and surjective. 


Example 1.9 These examples help illustrate the definitions just related and also 
demonstrate that there is no dependency between the two concepts of injective and 
surjective, t.e. neither property implies the other. 


Background Material m@ 11 


Let f:Z— Z by f(n) = 2n. This function is one-to-one but does not map 
onto Z. 


Let f :Z— {0,1} by 


if n is even 


0, 
f(r) = 1, ifn is odd 


This function is not one-to-one and maps onto {0,1}. 


. Let ff: ZZ by 


_ J n/2, if nis even 
f(n) n, if n is odd 


This function is not one-to-one and maps onto Z. 


. Let f: ROR by f(x) = 2x — 3. This function is a bijection. 


. Let f : {0,1} > {0,1} by f(0) = 0 and f(1) = 0. This function is neither 


one-to-one nor maps onto {0,1} 


Example 1.10 We introduce some important examples of functions which we shall 


see in this text. 


1. 


For any set A, the identity map on A, written 14 : A > A is defined by 
la(a) =a for alla € A. Certainly this map is a bijection. 


If A is a proper subset of a set B, the inclusion map writteni: A — B is 
defined by i(a) = a for alla € A. This map is again one-to-one but certainly 
cannot map onto B. 


For any sets A and B, the projection map onto A, writtenat,: Ax BoA 
is defined by wa(a,b) = a. This map maps onto A but cannot be one-to-one if 
|B| >1. 


. Consider any function f : A > B. The restriction map of f to C C A, 


written f | C:C — B is defined by (f | C)(c) = f(o) for allc eC. 


. Consider any function f : A— B. Anextension map of f to C 5 A, written 


fC — B is any map from C to B having the property that f(a) = f(a) for 
allace A. 


A simple example of this is the following extension of 1z to Q defined by 


es ~ {m m 
1z:Q->Z by lz (=) = mn, assuming that — is in lowest terms. 
n n 


. Let = be an equivalence relation on a set A. The quotient map (or canonical 


map) v: A > A/ = is defined by v(a) = [a]. This map certainly maps onto 
A/ =. 
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7. Let A be any set. A function a : A > A is a permutation of A if o is a 
bijection. The collection of all permutations of A is denoted by Sym(A) and 
is called the symmetric group on A. When A = {1,2,...,n}, then Sym(A) 
is typically denoted by S, and is called the symmetric group on n. 


Lemma 1.2 Let f : A— B be a function. 


1. fis a bijection iff for every b © B there exists a unique a € A such that 
f(a) = b. 


2. If A and B are finite sets of the same size, then f is one-to-one iff f maps 
onto B. 


Proof 1.2 We leave the first part as a simple exercise. To prove the second part, 
Ser A = {ais 4.) @,} ond B= {iwnrs bah. Then fA) — ff Giinens J (Gn) fb 
B. First assume that f is one-to-one. Then f(a1),...,f(G@n) are all distinct. Thus, 
|f(A)| =n = |B] and so f(A) = B proving f maps onto B. We prove the converse 
by contrapositive. If f is not one-to-one, then f(ay),...,f(an) are not all distinct. 
Thus, |f(A)| <n = |B| and so f(A) is a proper subset of B proving f does not map 
onto B. 


Definition 1.8 Let f: A> Bandg: BC. The composition function, written 
go f = {(a,c) : 46 © B such that (a,b) € f and (b,c) € g}. In other words, 
(go f)(a) = g(f(a)) (note that the domain of g is B so that g(f(a)) always makes 
sense). 


Definition 1.9 Let f : A— B be a function. The function g: B > A is an inverse 
of f ifgof=1,4 and fog= 18, te. forallac A andbe B, 


g(f(a)) =a and f(g(b)) = b. 


Example 1.11 Check for f,g : R > R defined by f(x) = 2x — 3 and g(x) = ot8 
that g is an inverse of f. 


In a sense, the inverse g of a function f undoes what the function f does. Below 
are some pertinent results concerning functions and their inverses: 


Theorem 1.1 Let f : A —> B be a function from a set A to a set B. 
1. f has an inverse iff f is one-to-one and maps onto B, t.e. f is a bijection. 
2. If f has an inverse, then it has exactly one. 


&. If f has an inverse, then the inverse is also one-to-one and maps onto B, i.e. 
the inverse is a bijection. 


4. If fy has inverse g; and fz has inverse go, then fio fo has an inverse, namely 
92° 9i- 
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Proof 1.3 To prove the first statement, first we assume that f has an inverse g. We 
show f is one-to-one. For ay, a2 € A, if f(ai) = f(a2), then g(f(a1)) = g(f(aa)), we. 
(9° f)(a1) = (go f)(a2). By definition of inverse, this equation reduces to a, = ao, 
and we have proved that f is one-to-one. To show that f maps onto B, take any 
be B. We have to find ana € A such that f(a) = b. The element a = g(b) does the 
trick, since f(g(b)) = (f © g)(b) = b. 

Now assume that f is a bijection. Define a function g : B —> A as follows: 
For b € B, by Lemma 1.2, there is a unique a € A such that f(a) = b. We then 
define g(b) = a. Note that g is indeed a function, since the element a € A is uniquely 
determined. We now prove that this g is the inverse of f. First, for any b © B, we 
have (f 0 g)(b) = f(g(b)) = f(a) = b. Second, take anya € A. Set b = f(a). Note 
that by definition of g, we have that g(b) = a. Hence, (go f)(a) = g(f(a)) = g(b) =a. 

To prove the second statement, suppose that g, and gz are inverses of f. We will 
show that gi = g2 and so f has only one inverse (when it exists). For any b € B, 
since f is a bijection, by Lemma 1.2, there is a unique a € A such that f(a) = b. 
Then 


g1(b) = (F(a) = (91 © f)(@) = @ = (92 0 f)(@) = gal F(@)) = g2(0). 


We leave the remaining proofs as exercises. 


Because of the fact that when an inverse exists there is only one, we can assign 
it notation without any confusion. The inverse of f will be denoted by f~!. Take 
note that this is simply notation and should not be taken literally as 1/f. There is, 
however, a use of the notation f~! which does not assume that the inverse of f exists. 


Definition 1.10 Let f: A— B be a function and C C B. The inverse image (or 
preimage) of C under f, written 


FC) ={aer ¢ fla) ec}. 


Example 1.12 Consider Example 1.9.2. The inverse image of 1 under f, i.e. 
f-'({1}) equals the set of odd numbers. 


The last topic of this section deals with the notion of a well-defined map. This 
topic arises when an input can be represented in more than one way. In particular, we 
shall look at the situation when inputs are equivalence classes. Basically we want to 
check when such a map is indeed a function, i.e. If a = b (two representations of the 
same input), then f(a) = f(b). For otherwise, we would have a single input being sent 
to two different outputs contradicting the definition of a function. In our particular 
situation, we might have an equivalence relation = on a set A and f : A/ => B. 
We want to make sure f is well-defined, i.e. that f is by definition a function. This 
entails checking that whenever [a;] = [a2] we have f([ai]) = f({a2]). This property 
is essential, for instance, when we define our group operation for cosets of a group 
modulo a normal subgroup. In Section 1.4, we will already see an important instance 
of well-definition. 
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Example 1.13 We will illustrate this concept and its verification with several exam- 
ples. 


1. Consider the equivalence relation congruence modulo 3 on Z. Define f : 
Z/ =3—> Z by f({n]) = n. This map is not a well-defined function, since 
for instance [0] = [3] while f({0]) =0 43 = f([3}). 


2. Consider the equivalence relation congruence modulo 2 and define f : Z/ =2—> 
Z by 
0, ifn is even 


PAI Vs: 1, ifn is odd 


We claim f is a well-defined function. If |n| = |m], then n =2 m and so 
2\(n —_m). Therefore, n —m is even and so either n and m are both even or 


both odd. In either case f({n|) = f([m]). 


3. Consider the equivalence relation = we defined earlier on Q (Example 1.8.4) 
and consider 


f:Q/ = Q defined by f({a/b]) = a2/b?. 


We show that f is a well-defined function. If [a/b] = |[c/d], then a/b = c/d and 
so ad = bc. Using properties of Z we have (ad)? = (bc)? and so a?d? = b?c?. 
Then a?/b? = c*/d? which implies f(a/b) = f(c/d). 


EXERCISES 


1 For each function in Example 1.9, verify the statements made for injective and 
surjective. 


2 For each of the following functions, decide if it is injective, surjective or a bijec- 


tion: 

a. f:R°°-> R*® by f(z) = 4. 

b. f :IR* > R* by f(z) = 4, where R* = R — {0} 
c. f:R—R by f(x) =sin (2?) 

d. f:Z2xZ—Zby f(m,n) = 3m+n. 

e. f:Z2xZ—>Zby f(m,n) = 4m + 2n. 

f. f:Z2xZ>Zx Zby f(m,n) = (2m,m+n). 


g. f:RxR-ORxR by f(z, y) = (22,r4+ y). 


3 Carefully explain why the example given in Example 1.10.5 is indeed an exten- 
sion. 


4 Let f: A> B,g:B4Candh:C-— D. Prove the following statements: 


a. Composition is associative, i.e. (hog)o f=ho(go f). 


10 
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f,g one-to-one implies go f one-to-one. 

If f maps onto B and g maps onto C, then go f maps onto C. 
g © f one-to-one implies f one-to-one. 

go f maps onto C' implies g maps onto C’. 


Give a counterexample to the following statement: go f one-to-one implies 
g one-to-one. 


Give a counterexample to the following statement: go f maps onto C' implies 
f maps onto B. 


Let A = {a,b,c} and B = {z, y, z}. 


a. 
b. 
c. 


d. 


How many possible functions f : A + B are there? 
How many possible surjective functions f : A + B are there? 
How many possible injective functions f : A > B are there? 


How many possible bijective functions f : A > B are there? 


Consider the functions f: A> Bandg: BOC. 


a. 


b. 


Prove that if f and g are bijections, then go f has an inverse and (go f)"! = 
Gog 


Give a concrete example where go f is a bijection, but f and g are not. 


Prove Lemma 1.2.1 


Prove the remaining parts of Lemma 1.1. 


Compute f~'({1}) for Example 1.9.3. 


Decide whether or not each of the following functions is well-defined: 


a. 
b. 


C; 


:Q— Q by f (F) = (FR). 
:Z/ =4— Z/ =s¢ by f( 


3 


f:Z/ Sn Z/ Sn by f (lal) = [2a]. 

f:Z/=n-7 Z/ =n by f({a]) = [ma + b], for some fixed integers m and b. 
f:Z/ =n Z/ =n by f (lal) = [2”] 

f:Z/ =n Z/ =n by f([a]) = [a*], for some fixed positive integer k. 

f : Z/ 27 Z/ =4 by f(lal2) = [ala 

f:Q—>Zby f(8)=m. 

f:Z/ => Z/ =6 by f([z]4) = [B}6 

i 

f 


8 
ey 
II 
8, 

for) 
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j. ff: Z/ Ss Z/ = by f((mla) = [m*]3. 
k. g: QQ by g([8]) = [22]. 


11 Consider f : Z/ =35- Z/ =5 xZ/ =7 by f(([z]35) = ([z]s, [z]7). 


a. Prove f is a well-defined function. 
b. Is f is injective? (justify) 

c. Is f is surjective? (justify) 

d. Is f is bijective? (justify) 


e. Does f have an inverse? (justify) 


12 Prove that f is one-to-one iff for any set C' and all functions h: C > A and 
k:C— Awe have that foh = fok implies h=k. 


13 Consider a function f : Z/ =,— Z/ =» defined by f([z]n) = [maz], where n,k 
and m are positive integers. Show that f is well-defined iff k|mn. 


1.3 BASIC NUMBER THEORY 


Some basic concepts in abstract algebra make use of number theory implicitly. In this 
section, we collect together concepts and results which we will need in later portions 
of the text. These concepts include the Division Algorithm, the greatest common 
divisor and results about prime numbers. 

We have already been introduced to the notion of one integer dividing another and 
this relation on integers is both reflexive and transitive. Furthermore, if we restrict 
ourselves to positive integers, then divides is also anti-symmetric. Here are two further 
properties of divides. 


Lemma 1.3 Let m,n,d € Z. 


1. If m|n and n|m, then m= tn. 
2. If d\m and d\n, then d\(am + bn) for all a,b € Z. 


Proof 1.4 We prove the first part and leave the second as an exercise. Since m|n 
and njm we have n = mk and m = nl and so m = mkl which can be rewritten as 
m(1 — kl) = 0. Hence, either m = 0 (and so n = 0k = 0 proving the result) or 
1—kl = 0. In the latter case, kl = 1 which as integers implies either k,l = 1 or 
k,l =—1 and som=nl =n. 


We now prove the Division or Euclidean Algorithm for integers. We leave as an 
exercise the following result which will be used in the theorem: For all n € Z we 
have n+ |n| > 0. 
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Theorem 1.2 (Division/Euclidean Algorithm) Let n,d € Z with d > 0. There 
exist unique q,r € Z having the property thatn = qdt+r withO<r<d. 


Proof 1.5 To show existence of q and r consider the set D={n—ad : x €Z and 
n— ad > 0} which is certainly a subset of N, the natural numbers. Now D is non- 
empty, since for instance n —(—|n|)d =n+|n| > 0 is in D. Because D CN, it must 
have a smallest element, say r, andr € D implies r = n—qd for some q € Z. Solving 
for n we have n = qd+r which is half of what we need to show. We already know 
r > 0 so it remains to show that r < d. Notice that r—d=n—qd—d=n-(q+l)d 
which is of the right form to be in D. However, r—d <r andr is the smallest element 
of D which implies that r —d must fail to be non-negative, i.e. r—d <0 orr <d. 

To show uniqueness of q and r, suppose thatn = qd+randn=dd+r' with 
O0<r,r’ <d. Without loss of generality, assume that q > q'. Suppose, to the contrary, 
that q > qd and so as integers we would have q > qd +1. Then r’ = n—-—dd > 
n—(q—1)d = r+d > d, a contradiction. Hence, q= q and sor = n—qd=n-—dd=r' 
thus proving uniqueness. 


The integer multiples of n € Z will be denoted by nZ. For instance, 


3Z = {0, +3, +6, +9,...}. 


In the next lemma, we will prove some useful facts about nZ which will be used later 
in the text. 


Lemma 1.4 Consider the set nZ for some n EN. 


1. If x,y € nZ, then so arex+y,x—y € nZ. We say that nZ is closed under 
addition and subtraction. 


2. If X is any non-empty subset of the integers closed under addition and subtrac- 
tion (see part 1), then X = nZ for some n EN. 


Proof 1.6 The first part is easy and left as an exercise. For the second part, should 
X = {0}, then X = 0Z and we are done. Otherwise X has positive elements. Indeed, 
if we take any non-zero element m © X. Then0 =m—me€ X by assumption and 
so —m =0-—me X again by assumption. Since m and —m are in X and one of 
these must be positive we can conclude X has positive elements. Now let n be the 
smallest positive element in X. We show that X = nZ. First, take x € nZ so that 
x = nk. Ifk = 0, then x = 0 and as we saw above 0 € X. If k ts positive, then 
mS peer X by assumption (and induction). If k is negative, then as we 


k 
saw above —n € X and x = (—n)+---+(-n) € X by assumption and induction. 
e—{+_——__ SS 
—k 
Hence, nZ © X. For the reverse inclusion, take any x € X and write x = nqg+r 
with0O<r<n using the Division Algorithm. Since nZ C X and X is closed under 
addition and subtraction, it follows that r = « —nq € X. Since n is the smallest 


positive integer in X it must be that r= 0 and so x = nq € nZ. 
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Definition 1.11 Leta and b be two non-zero integers. The integer d is the greatest 
common divisor of a and b, written d = gcd(a, b) if 

is. U 

2. dla and d\b (common divisor) and 


3. ela and e\b implies e\d (greatest). 


Example 1.14 gcd(—120,36) = 12. We note here that the Division or Euclidean 
algorithm also refers to a systematic way for finding the greatest common divisor of 
two integers. Its method relies on Theorem 1.2 of the same name. 


Theorem 1.3 For any non-zero integers a and b the greatest common divisor of a 
and b exists and is unique. 


Proof 1.7 To show existence, consider the set C = {ax+ by : x,y € Z}. Certainly 
C # @ (since for instance a = a-1+b-0 € C). Furthermore, C is closed under 
addition and subtraction, since 


(ax, + by) + (arg + by2) = a(x, + x2) + b(y1 + y2) EC. 


Therefore, by Lemma 1.4, we know that C = dZ for some positive integer d. We 
show now that d = gcd(a,b). We already have d > 0. Since a,b € C = dZ we can 
write a = dk and b = dl for some integers k andl which implies dja and d\b. Finally, 
if ela and e|b, sinced = d-1€ dZ=C we can write d = axo+byo for some Xo, yo € Z. 
Hence, by Lemma 8.1, e|(axo + byo), i.e. eld. 

To show uniqueness, assume d and d' are both greatest common divisors of a and 
b. Since dla and d|b and d' = gcd(a,b) this implies d|d’. Reversing roles yields d'\d 
and so by Lemma 8.1, we have d = +d’. However d,d' > 0 so we can conclude that 
=a 


Corollary 1.1 Let a and b be non-zero integers. 
1. If d= gcd(a,b), then there exist xo, yo € Z such that d = axg + byo. 


2. gcd(a,b) = 1 iff there exist xo, yo € Z such that axo + byo = 1. 


Proof 1.8 The first part follows immediately from the existence part of the proof of 
Theorem 1.8. For the second part, one direction follows immediately from the first 
part of the corollary. Assume that there exist xo, yo € Z such that aro + byo = 1 and 
set d = gcd(a,b). Since dla and db it follows that d\(axo + byo) and so d|1 which 
forces d to equal 1, since d > 0. 


Definition 1.12 A positive integer p is prime if p has exactly two distinct positive 
divisors, i.e. p#1 and ifa>0 and alp, then eithera=1 ora=p. 


Lemma 1.5 An integer p> 1 is prime iff whenever a,b € Z and plab, then either 
pia or pb. 
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Proof 1.9 First assume that p is prime and assume that plab. If pla, then we are 
done. Otherwise p { a. In this case, we will show p|b. Since p { a this implies 
gcd(p,a) = 1 (convince yourself of this). By Corollary 1.1, there exist xo, yo € Z 
such that pro + ayo = 1. Then b = b(pxp + ayo) = p(bx0) + ab(yo). Since plp and 
plab we have p\[p(bxo) + ab(yo)], te. p|b. For the converse, assume whenever a,b € Z 
and p\ab, then either pla or p|b and p is not prime. This implies p = ab for some 
1<a,b<p (explain). Since p|p we have plab and so by assumption pla or p|b which 
is not possible, since 1<a,b <p, thus a contradiction. 


Corollary 1.2 If p is prime and p|(aja2---an), then pla; for somei, 1<i<n. 
Proof 1.10 This follows immediately by induction. 


Next we prove the existence and uniqueness of prime factorization for integers 
greater than 1 called the Fundamental Theorem of Arithmetic. 


Theorem 1.4 (Fundamental Theorem of Arithmetic) Any integer a > 1 can 
be factored uniquely as a product of primes. More precisely, there exists unique primes 
Pi < po <++*pn and positive integers e1,e2,...,€n such that a= phi ps? +--+ per. 


Proof 1.11 First, we prove existence of a prime factorization. Suppose to the con- 
trary that there were integers greater than 1 which had no prime factorization. Set P 
equal to the set of all integers greater than 1 which do not have prime factorizations 
(which we are assuming to be non-empty). Let m be the smallest such element of P. 
Certainly m is not prime (else it has a trivial prime factorization), so write m = ab 
with 1 <a,b<m. Since m is smallest in P we know a and b are not in P and so 
have prime factorizations. But then multiplying these two factorizations will yield a 
prime factorization for m = ab, a contradiction. 

To prove uniqueness, again we suppose to the contrary that there are integers 
greater than 1 which have more than one prime factorization and set U equal to the 
set of all such integers. Since U is non-empty, let m be the smallest element of U. Say 
m = pip ++ ps and m = qf! qs? ---qf* where pr < po < +++ Py and qi <q <-*- 4 
are primes and €1,€2,.--,€n and fi, fo,..., fe are positive integers. Equating these 

k 


two factorizations we see that p, divides gh af? tee gi and by Corollary 1.2, pilq; for 
some i, 1<i< k. But then " = q (explain) and so by cancellation we have 


a Lye? & Sy — gf. : ee _ Set r= a tf? .. . pen = a qi. ge? Ss , qi. 
Since : <m Ue know hit r has a oe prime factorization, and so pe 18? .. . pen 
and qf} . qi” : . gi must be identical factorizations. If we throw py = q back in 
to get the A prime factorizations of m, then p{' ps? --- po and qi" q5 re - git must be 


the same as well, which is a contradiction tom € P. 


EXERCISES 


1 Define a relation ~ on Z*! (integers greater than 1) as follows: n ~ m iff 
there exists a prime p such that p|m and p|n. Decide whether or not it is an 
equivalence relation. If it’s not, then provide a counterexample. If it is, verify 
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the axioms of an equivalence relation and then describe what the equivalence 
classes represent. 


2 Prove Lemma 8.1.2. 


3 Decide whether each of the following statements are true. If so, give a proof; 
otherwise, give a counter-example. 


a. If a,b,c € Z and ald, then al(bc). 
b. Ifa,b€ Z and a|(b— 1), then a|(b? — 1). 
c. If alc and b\c and gcd(a, b) = 1, then (ab)|c. 


4 Prove (by cases using the definition of absolute value) that for all n € Z we 
have n+ |n| > 0. 


5 Prove Lemma 1.4.1. 

6 For an integer a, prove if p is prime and pla, then gcd(p, a) = p. 

7 For an integer a, prove if p is prime and p{ a, then gcd(p,a) = 1. 

8 For an integers a,b,c, prove if alc, b|c and gcd(a, b) = 1, then (ab)|c. 

9 For an integers a,b,c, prove gcd(a,bc) = 1 iff gcd(a,b) = 1 and gcd(a,c) = 1. 
10 For an integers a,b,c, prove if cjab and gcd(a,c) = 1, then c|b. 


11 Prove that if d = gcd(n,m), then we can express n = dx and m = dy for some 
integers x and y with gcd(x,y) = 1. 


12 Prove by induction Corollary 1.2. 


1.4 MODULO ARITHMETIC 


Our focus in this section is the equivalence relation congruence modulo n on Z. 
First, we prove some basic properties of congruence modulo n. This first property 
of congruence modulo n is, in fact, characteristic of a more general concept called a 
congruence relation. 


Lemma 1.6 Leta,b,c,d € Z. Ifa =, c and b =, d, then a+b =, c+d and ab =,, cd. 


Proof 1.12 Since a=, c andb=, d we have n|(a—c) and n|(b—d). Then n|[(a—c)+ 
(b—d)] or equivalently n||(a+6) —(c+d)| which implies a+b =, c+d. Furthermore, 
n|[(a — c)b + (b — d)c] or equivalently n|(ab — cd) which implies ab =, cd. 


Lemma 1.7 Let n> 1 and a be integers. There exists b € Z such that ab=, 1 iff 
gcd(a,n) = 1. 
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Proof 1.13 Note that ab=, 1 iff n\(ab—1) iff ab—1=nk for somek eZ iff 
ab + n(—k) = 1 for some k € Z which implies gcd(a,n) = 1, by Corollary 1.1.2. 

Assuming gcd(a,n) = 1, by Corollary 1.1.2, there exist xo,yo € Z such that 
axo + nyo = 1. Then as we can see from the previous direction, the b we are looking 
for such that ab =, 1 is x0. 


Now let’s focus on the collection of equivalence classes Z/ =,. The first questions 
that need to be addressed is how many distinct classes are there and is there a nice 
way to represent them? The next result answers these questions. 


Lemma 1.8 For any positive integer n we have that Z/ =,= {[0], [1],...,[n — 1}. 


Proof 1.14 First note that any class is equal to one of [0], [1],...,[n — 1], since if 
m€Z, using the Division Algorithm, we can write m = qn+r for integers q and r 
and0<r<n. Now since m—r = qn this implies n|(m—r) and som =, r which 
implies that [m| = [r] whereO <r <n-—1. Finally, note that the [0}, [1],...,[n — 1] 
are all distinct, since if [|r] = [s] forO < r,s <n-—1, thenr =, s and so n\(r — s) 
which implies r— s = nk for some k € Z. But since —n <r—s <n, the only way 
this could be possible is ifr—s=0 orr=s. 


We would now like to take these classes of Z/ =, and define an addition and 
multiplication for them. The most natural way to proceed would be to define class 
addition and multiplication as addition and multiplication of representatives, i.e. 


[a] + [6] = [a+ 9] and [a] - [b] = [a- 8). 


The problem is that we have to be sure these two binary operations which are 
functions from (Z/ =,) x (Z/ =n) to Z/ =» are well-defined. As it turns out there is 
nothing to fear as is proved below. 


Lemma 1.9 The operations of addition and multiplication in Z/ =, defined by |a]+ 
[b] = [a + | and [a] - [b] = [a- b] are well-defined. 


Proof 1.15 What we must show is that if [a] = [c] and [b] = |d], then [a] + [b] = 
[c] + [d] and [a][b] = [c][d]. But this follows almost immediately from Lemma 1.6. 


Having the peace of mind that these two operations are well-defined, we now 
list some properties these operations enjoy. We will later call these properties the 
properties of a commutative ring with unity. We leave the details as a simple 
exercise which relies heavily on properties of the integers. 


Lemma 1.10 Jf a,b,c € Z, then 
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We point out that not every class [a] € Z/ =, has a class [6] such that [a][b] = [1]. 
In fact, we see from Lemma 1.7 that such a class [b] exists iff gcd(a,n) = 1 (exercise). 
A class [a] which has such a [}] is called a unit. Some other classes [a] have the 
property that there is a class [b] such that [a][b] = [0]. A class [a] ¢ [0] which has 
such a [b] is called a zero divisor. 


Example 1.15 In Z/ =j0 the class [3] is a unit since [3][7] = [1] and [2] is a zero- 
divisor since |2|[5] = [0]. 


In fact, the following is true: 


Lemma 1.11 Every class [a] © Z/ =, not equal to [0] is either a unit or a zero 
divisor. 


Proof 1.16 Let [a] 4 [0]. Either gcd(a,n) = 1 or gcd(a,n) > 1. In the former case, 
by Lemma 1.7, we know then that |a] is a unit. In the latter case, set d = gcd(a,n) > 1. 
Since dla and d\n we may write a = dk and n = dl for some integers k,l € Z (note 
that | #0 since n £0). Notice that [a]{!] = [al] = [dkl] = [nk] = [0], since nk =, 0. 


Hence, we see that [a] is a zero divisor. 


Note that a Corollary to this result is the fact that every non-zero class in Z/ =, 
(where p is prime) is a unit. 


Lemma 1.12 /Fermat’s Little Theorem] Let a,p € Z with p a prime number and 
pta. InZ/ =, we have {al?~* = [1]. 


Proof 1.17 Since [a] # [0] we know [a] is a unit and so there is a class [b] with 
[a][b] = [1]. Consider the following list of classes: [1][a], [2||a],...,[p—1]|a]. First note 
that all the classes in this list are distinct, for if |r|[a] = |s|[a], then by multiplying 
on the right by |b] we have [r| = [s]. Furthermore, these classes are all not equal 
to [0], since if [r|[a] = [0], then again by multiplying on the right by [b] we have 
[r] = [0] which is not the case. Therefore, this list is simply a reordering of the classes 
[1], [2],...,[p—1] and so [1][a][2][a] - - - [p—1][a] = [1][2]---[p—1]. This can be rewritten 
as [a]?~1[2]---[p — 1] = [2]---[p—1]. Since [2],...,[p —1] are units, we can reduce 
this equation to [a]P~' = [1]. 


We also have the following result: 
Corollary 1.3 Let a,p € Z with p a prime number. In Z/ =p we have [a]? = [p}. 
An equivalent way to represent Z/ =, with its two operations of addition and 


multiplication is to consider the following structure. Let Z, = {0,1,2,...,2—1} and 
define addition, denoted by +, and multiplication, denoted by -,, for these elements 
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as follows: m+, k = r if using the Division Algorithm, m+ k = qn+r where 
0<r<_n. Similarly, m-, k = r if using the Division Algorithm, m-k = qn+r 
where 0 < r < n. There is a direct correspondence (which we shall call later an 
isomorphism) between Z/ =, and Z,. First off a class [a] € Z/ =, can be equated 
with a € Z, by Lemma 1.8. Furthermore, for 0 < a,b,c <n—1, [a] + [6] = [ce] iff 
a+,,b = cand [a][b] = [c] iff a-,b = c. We prove this fact for addition (multiplication 
is similar). Suppose first that [a] + [b] = [c]. This implies that a+ b =, c and so 
n|(a + b—c) which implies a+ b— c= ng or a+b=nq+c where 0 <c <n. Then 
by definition a +, 6 = c. Now assume that a+, 6 = c. By definition this implies 
a+b =gqn+c where 0 <c<_n. Thus, gn = a+b-—cand so n|(a+ b—c) which 
implies a + b =, c which is equivalent to [a] + [6] = [c]. 

As a result of this observation, all the properties we proved in this section about 
Z/ =n are equally true for Z, (with the square brackets removed from the statements 
of the properties). We will be using the structure Z,, from now on and dispensing 
with Z/ =n. 


EXERCISES 


1 Prove Lemma 1.10. 
2 Prove that [a] € Z/ =, has a class [b] such that [a][b] = [1] iff gcd(a,n) = 1. 
3 Consider Z/ =¢ 


a. Write out the addition and multiplication tables 


b. List the units and zero divisors 


4 List separately the units and zero divisors of Z/ =j9. Illustrate this explicitly 
for each class as we did in Example 1.15. 


5 Use Fermat’s Little Theorem to compute [4'°°]13. 
6 Prove that every non-zero class in Z/ =, (where p is prime) is a unit. 


7 Prove Corollary 1.3. 


CHAPTER O 


Basic Group Theory 


N THIS CHAPTER, we introduce the reader to a group structure. In Section 2.1, 
| we present basic definitions, examples and terminology related to groups. In Sec- 
tion 2.2, we define a subgroup, give many classic examples and present a shortcut 
for verifying subgroup. In Section 2.3, we present an important class of groups called 
cyclic groups. Permutation groups are an essential part of group theory and are pre- 
sented in Section 2.4. In Section 2.5, we create new groups and subgroups by means 
of a product. In Section 2.6, we introduce functions between groups called homomor- 
phisms. As a follow up, in Section 2.7, we use isomorphisms to define what it means 
for groups to be essentially equal (or isomorphic). In Section 2.8, we define perhaps 
one of the most equivalence relations on a group whose equivalence classes are called 
cosets, and these cosets can sometimes form a group. In Section 2.9, we investigate 
exactly when cosets form a group and look at several important examples. In Sec- 
tion 2.10, we investigate further notmal subgroups and consider groups which have no 
normal subgroups. Finally, in Section 2.11, we prove some fundamental isomorphism 
results for groups and factor groups. 


2.1. DEFINITIONS AND EXAMPLES 


In this section, we introduce the reader to one of the main topics under investigation 
in this text, namely a group structure, which is one of several algebraic structures we 
shall study. 


Definition 2.1 A group (G, *) consists of a set of elements G together with a binary 
operation * satisfying the following four axioms: 


Closure — For all g,h € G we have ge heEG. 
Associativity — For all g,h,k © G we have g * (hx k) = (g*h) *k. 


Identity — There is a special element e € G having the property that for all 
g€G we have gxe=g=exg. The element e is called the identity element 
of the group. 


Inverse — For every g € G there is anh € G such thatgxh=e=hxgq. The 
element h is called an inverse of g. 
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If in addition the group satisfies the commutative property, t.e. for allg,hEeG 
we have gx h=hxg, then we call the group an abelian group (or commutative 


group). 


We remark that this binary operation * can be viewed as a function from G x G 
into G, ie. *(g,h) = g *h for g,h € G. This observation will become useful in the 
discussion of well-definition of a binary operation. 


Example 2.1 Here are a list of some classic examples of a group. Note below that 
N* means the collection of numbers N without the number 0. 


1. 


10. 


11. 


12. 


(Z, +), the integers together with addition, forms an abelian group. The identity 
element is 0 and the inverse of n © Z is —n. In fact, Z can be replaced by Q, 
R or C to get other additive abelian groups. 


. (Z*,-) together with multiplication does not form a group, since only +1 have 


inverses in Z*. 


. (,-) forms an abelian group. In fact, Q* can be replaced by R* or C* to get 


other multiplicative abelian groups. 


. (Zn, +n) forms an abelian group. 
. (Zn) forms an (abelian) group iff n is a prime number. 


. For any set A, the collection of permutations of A, i.e. Sym(A), together with 


the operation of composition forms a (not necessarily abelian) group. 


An example of commutativity failing is A=R, f(x) =a2+4+1 and g(x) = 2°. 


Notice we have (f 0 g)(x) = 23 +1 while (go f)(x) = 2° + 327 + 3x +1. 


. The collection of real m x n matrices, written Mmy(R), with matrix addition 


forms an abelian group. 


. The collection of n x n real invertible matrices, written GL,,(R) and called the 


general linear group, with matrix multiplication forms a group. 


. The collection of n x n real matrices of determinant 1, written SL,(R) and 


called the special linear group, with matrix multiplication forms a group. 


The collection of n x n real upper triangular matrices with 1’s on the diagonal, 
written U,(R) and called the unipotent group, with matrix multiplication 
forms a group. 


The collection of all n x n real diagonal matrices, written Dp(R), forms an 
abelian group. 


Consider a square and let G consist of all rigid motions of the square which 
leave the square unchanged in look. There are exactly 8 such rigid motions 
which can be divided into two types of equal number: rotations and reflections. 
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1 2 


3 2 3 : Q 


Figure 2.1 At the top is the original square. The bottom left is a 90° rotation of the 
square while the bottom right is a vertical reflection. 


Lids 


There are the 0°, 90°, 180°, and 270° clockwise rotations, one horizontal, one 
vertical and two diagonal reflections. Figure 2.1 shows the square as well as a 
90° rotation and a vertical reflection. This group is called the dihedral group 
and is denoted by D4. One can easily generalize this definition to D, (forn > 3) 
which consists of n rotations and n reflections of a regular n-gon. 


Consider the set with 8 elements +1,+7,+j,+k and define an operation on 
this set satisfying the identities i? = 72 =k? =—-1, ij =k, jk=i, ki=j, 
(-1l)i=-i, (-1)j =—-Jj, (—1)k = —k. This group is called the quaternions 
and is denoted by Qs. 


Remark 2.1 Here are some remarks and some observations which are easily verifi- 
able from the group axioms. 


1. 


2. 


The groups D4 and Qs are distinct (what we shall call non-isomorphic) groups. 


There is only one identity element in a group. Indeed, if e and e’ were identities, 
thene = exe! =e’. From now on we will denote this unique identity by the 
notation 1 (not to be taken literally). 


Every element of a group has only one inverse. For if h and h' were inverses of 
an element g in a group, thenh = hxe = hx(gx*h’) = (hxg)*h'’ =exh' =h’. 
This allows us to introduce the notation g~' as representing the unique inverse 
of G. This notation is derived from the rational numbers, but should not be 
taken literally, i.e. g-' 4.1/g (in fact, it makes no sense whatsoever!) 


. From now on we will drop the * and simply write gh for g *h. 


. Oftentimes we will refer to a group (G,*) simply by its set G when the operation 


is either understood or completely arbitrary. 
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Figure 2.2 A group multiplication table. 


6. Notice that even though we are dealing with an abstract group operation, we 
employ suggestive multiplicative notation. This is merely our convention. We 
could have very well represented things in an additive way, such as using 0 for 
the identity and —g for the inverse of g. In fact, this is oftentimes used when 
dealing exclusively with abelian groups. 


7. We leave it as an exercise to check that our identity and inverse axiom could be 
weakened to merely a left identity and left inverse and we still have a group. 
In other words, the identity axiom could be simply eg = g and the inverse axiom 
could simply be hg = e. 


8. A group has the cancellation property, i.e. if g,h,k € G and gh = gk or 
hg = kg, thenh=k. 


Definition 2.2 The order of a group G, written |G|, is the cardinality (or size) of 
the set G. If G has a finite set of elements we called the group a finite group or a 
group of finite order. Otherwise G is called an infinite group. 


2.1.1. Groups of Small Order 


We begin what is called a classification result where we seek to list all possible 
distinct (to be defined) non-isomorphic groups with a certain property. In this case, 
the group property is its size. 

First, let us dispense with the trivial group G = {1} which has only an identity 
element and is the only group of order 1. In presenting this material we make use of 
what we call a multiplication table reminiscent of the multiplication tables used by 
children to learn their times tables. An example of such a table is given in Figure 2.2 
for a group with four elements 1, g,h and k. 

We fill in the empty boxes by cross referencing each element of G in the first 
column, say h, with an element of G in the first row, say k, and fill in the cross 
referenced box with h * k or simply hk (order is important) as we did in Figure 2.3. 
Note that since G satisfies the closure property, we know that hk must be one of the 
elements 1, 9, h, k. 
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Figure 2.5 The identity row & column of a group multiplication table. 


Example 2.2 Consider the group Z4 with addition (modulo 4). Its multiplication 
table is given in Figure 2.4. 


By the identity axiom the second row of any group multiplcation table must be 
identical to the first row and the second column of the table must be identical to the 
first column (see Figure 2.5). 

Another property of the table we can deduce using the cancellation property of 
G is that each of the elements of G must occur exactly once in each row and column 
(convince yourself of this). Hence, the rows and columns of G are permutations of the 
elements 1, 9,h,k (this type of table has been studied and is called a latin square). 
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Figure 2.7 The group multiplication tables for a group of order 4. 


If the table has symmetry across the diagonal starting at the upper lefthand corner, 
then G is abelian. With this in mind let us begin writing multiplication tables for 
small groups. 

Because of the conditions we just mentioned, there is only one way to complete 
a multiplication table for group of order 2 or 3 (see Figure 2.6). 

Note also that these two groups are abelian. Our conclusion then is that there is 
only one group of order 2 and only one group of order 3 and they are both abelian. 
When we look at groups of order 4 the situation becomes slightly more complex. 
One can show that there are only two possible distinct tables one can construct (see 
Figure 2.7). 

There are, in fact, other seemingly different tables, but one can argue that they 
are equivalent to one of these two tables (later we shall say they are isomorphic) by 
simply renaming the elements of the group. The two above are clearly different tables, 
since for instance in the second any element multiplied by itself is the identity, which 
is not true for the first table. Notice that both groups are abelian. Notice also that 
the first table is equivalent (isomorphic) to the table for (Z4,+4) if we rename 1 as 
0, gas 1, has 2 and kas 3. The other table is the multiplication table for what we 
call the Klein-4 group (named after the German mathematician Felix Klein). Our 
conclusion then is that there are exactly two distinct groups of order 4 and they are 
both abelian. We will explore higher order groups later on in the text once we have 
more machinery. 
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2.1.2 Group Exponentiation 


There is a natural way to define exponentiation for a group and we do so now. It is 
identical to how one defines exponentiation for real numbers. 


Definition 2.3 For any g € G and n a positive integer, 
fed 
‘= 4g, 
7 = 99g 


One can show using induction and cases that this definition of exponentiation sat- 
isfies the same properties as the usual exponentiation for real numbers. We summarize 
these properties in a proposition. 


Proposition 2.1 For all g € G and integers m and n we have 


2. (g™)-* = (g*)™ 
3. (gy = 9" 


A key notion in the study of groups is the notion of conjugacy. 


Definition 2.4 Given two elements g,h € G a group, we say g is conjugate to h, 
if there exists ana € G such that h = aga™!. It’s easy to check that conjugacy is an 
equivalence relation on G and so we may consider the equivalence classes associated 
with this relation, called conjugacy classes. Forg € G, 


[9] ={hEG : h=aga™ for someae G}={aga' : a€ Gh. 


EXERCISES 


1 Verify the group axioms for the structures given in Example 2.1, namely 


a. 


b. 


Z,+) is an abelian group. 


,+) is an abelian group. 


) 
) 
+) is an abelian group. 
+) is an abelian group. 
) 


-) is an abelian group. 


IR*,-) is an abelian group. 


iw) 


w 


aD 


“I 


Co 


Ke) 


10 


11 


12 
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g. (C*,-) is an abelian group. 
h. (Zn,-+n) forms as abelian group. 


i. For any set A, the set Sym(A) together with the operation of composition 
is a group. 


j. The collection of real m x n matrices with matrix addition is an abelian 
group. 


k. The collection of n x n real invertible matrices with matrix multiplication is 
a group. 


l. The collection of n x n real matrices of determinant 1 with matrix multipli- 
cation is a group. 


m. The collection of nxn real upper triangular matrices with 1’s on the diagonal 
and with matrix multiplication is a group. 


n. The collection of all real diagonal matrices is an abelian group. 
As was done in Figure 2.1, draw the remaining six elements of D4. 


Give a counterexample to illustrate why (Z*,-) together with multiplication 
does not form a group. 


Prove that (Z*,-,) forms an (abelian) group iff n is a prime number. 
Give an example of two invertible matrices which do not commute. 


Check that the identity and inverse group axioms could be weakened to merely 
a left identity and left inverse and we still have a group. In other words, the 
identity axiom could be simply eg = g and the inverse axiom could simply be 
hg =e. 


Show that a group has the cancellation property, i.e. if g,h,k € G and gh = gk 
or hg = kg, then h = k. 


Explain why in a group multiplication table each element of the group must 
occur exactly once in every row and column of the table. 


Write out the multiplication table for D4 and use it to verify that D4 is indeed 
a group. 


Write out the multiplication table for Qg and use it to verify that Qs is indeed 
a group. 


Using your work for the multiplication table of D4 and Qg explain why they 
represent different groups of order 8. 


Without using Proposition 2.1, prove that for any g € G a group we have 
=1\=1 
ie Mie. 
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13 Prove for all g,h € G a group we have (gh)"' = h-tg7!. 


14 Prove for all gi,...,9, € G a group we have 


(91-°'9k) = 9 GL 


15 Show that if G is a group and a,b € G with o(ab) = n, then o(ba) = n as well. 
16 Prove Proposition 2.1. 


17 Prove that conjugacy forms an equivalence relation on a group G (see Defini- 
tion 2.4). 


2.2 SUBGROUPS 


For every algebraic structure there is a notion of a substructure — a subset of the 
structure that preserves all of its axioms. For groups these substructures are called 
subgroups. 


Definition 2.5 Let (G,*) be a group. A non-empty subset H C G is a subgroup 
of G if (H,*) is a group. In other words, H together with the operation of G still 
satisfies the four axioms of a group. The notation to signify that H is a subgroup of 
GisH<G. 


Example 2.3 Here are some subgroup chains related to the examples we gave in the 
previous section. 


1. (Z,+) < (Q,+) < (R,4+) < (C,+4). 


Before we give any more examples we wish to point out some shortcuts for veri- 
fying when a non-empty subset is a subgroup of a given group. 


Lemma 2.1 Let H be a non-empty subset of a group G. 
1. H<G iff For all hy,ho € H we have hyhz' € H. 


2. If in addition H is finite, then H <G iff For all hy, ho € H we have hyh2 € H 
(in other words, it suffices to check the closure axiom for H). 


Proof 2.1 For the first part, assuming that H < G, take hi,ho € H. Since H 
satifies the inverse property we know that hs € H. And since H satisfies the closure 
property we have that hihy! € H. Now assume that for all hi,h2 € H we have 
hyhs' € H. We need to verify that H satisfies the four axioms of a group. To show 
the identity property, take any h € H (here we use the fact that H is non-empty). 
Then by assumption we have hh~! € H (set hy, and hy equal to h) — in other words 
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1é€H. To verify the inverse property, take any h € H. Then by assumption we have 
1h"! € A (set hy = 1 which we now know is in H and hg = h) - in other words 
h-' € H. To verify the closure property, take and h,k € H. Then by assumption we 
have h(k~')~' € H (set hy =h and hy = k~' which we now know is in H) - in other 
words hk € H. The associative property for H is simply “inherited” from the larger 
group G — in other words, when taking any three elements in H, since they are also 
elements in G, and G has the associative property, they must associate. 

For the second part, one direction is immediate: If H < G, then being a group it 
satisfies the closure property. Now assume that H satisfies the closure property. By 
the first part it suffices to show that H satisfies the inverse property. Take any h © H 
and consider the list h,h?,h>,... which is contained in H (by closure). Then some 
power of h must equal the identity. To see this, first note that since H is finite the 
list above must have repeats, i.e. there exist 1 <i < j such that h’ = h’. But then 
hi-*=1. Setm=j—-i. Ifm=1, thenh=1 and certainly has an inverse (namely, 
itself). Ifm>1, then h™*h=1 andh!=h™' EH. 


Example 2.4 Here are some additional examples of subgroups which we can now 
verify more readily using Lemma 2.1 


1. To verify that (Q*,-) < (R*,-) take any $,5 € @ (note that a,b,c,d # 0 and 
so ad, bc £0). Then 


ay coy ay fa ad x 

CloMe Cmts 
2. (nZ,+) < (Z,+), for any n € Z, since if nk, nl € Z we have (nk) + (—nl) = 
n(k—1) € nZ (note that in Lemma 2.1, hyhy' written additively is hy +(—h2)) 


3. Consider the group C* with multiplication and the subset 
H={2¢eC 2.2" = 1}, 


for some fixed positive integer n. Notice that H is finite, since it consists of all 
the roots of the polynomial x" — 1 which is a polynomial of degree n and so has 
at exactly n roots in C*. Therefore, to show H < C* tt suffices to show that H 
satisfies the closure property. Take 21, z2 € H, then (z22)" = zfzy =1-1l=1 
and so 2122 € H (here we use the fact that C* is abelian). 


Remark 2.2 Some of the remarks below will be left as exercises for the reader. In 
what follows G always represents a group. 


1. If A,K <G, then HONK <G. 
2. If H, K <G, then it is not necessarily the case that HUK <G. 


38. IfH,K <G, thenHJIUK<G iff Hither H < K or K < dH. 
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4. For any g € G the set H = {g"” : n € Z} <G, called the cyclic subgroup 
generated by g. 


5. Any group G always has the trivial subgroup, {1}, and the improper sub- 
group, G. 


6. Forg € G define 
Calg) ={heG : gh=hg}, 
called the centralizer of g in G. Then Co(g) < G. 


7. Take any subset X CG and define 
Ce(X) ={hEG : gh=hg, forallg eX}, 


called the centralizer of X in G. Then Cg(X) < G. Note, when X = {g}, 
Ce(X) = Ca(g) and when X = G we call Cg(G) the center of G and use the 
notation Z(G). 


&. Let X be a non-empty subset of a group G. The subgroup generated by X, 
written (X) is the collection of all finite products of elements of X and their 
inverses. The set X is called the generating set of (X). Then (X) <G 


9. If X = {91,92,---;9n} ts a finite set, then we write (g1,92,---,;9n) for (X). 
Note that if X = {g}, then (X) is simply the cyclic subgroup generated by g. 


10. One can show that 


e (X) is the “smallest” subgroup of G containing the set X, i.e. if H<G 
and H contains X, then (X) < H. 


e (X) is the intersection of all subgroups containing the set X. 


Sometimes it is convenient to present what is called a Hasse Diagram or Lattice 
of Subgroups of a given group. 


Example 2.5 We will illustrate the concept of a Hasse Diagram with two examples. 


1. Consider the group Zi with addition modulo 12. One can check that the only 
proper subgroups of this group are the following list of subsets: {0,2,4,6,8, 10}, 
{0,3,6,9}, {0,4,8}, {0,6}, {0}. The Lattice of Subgroups is presented as a 
graph with the subgroups being the vertices of the graph and there is a connection 
between two vertices if the set below is contained in the set above it. The Lattice 
of Subgroups for Zy2 is given in Figure 2.8. 


2. Consider the Quaternion group Qg = {+1, +i,+j,+k}. One can check that the 


only proper subgroups of Qg are {1}, {41}, {41,47}, {41,47}, {+1, +4}. 
The Lattice of Subgroups for Qs is given in Figure 2.9. 
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{ 0,4,8} 


—— { 0,3,6,9 } 
\ 
/ 


{0} 


Figure 2.8 The Lattice of Subgroups for Zy9. 


5 ™ 
{ 1,43 } em {414j7} >e{4l,4k} 
i od a 


{1} 


Figure 2.9 The Lattice of Subgroups for Qs. 


EXERCISES 


1 Verify subgroup for each of the statements in Example 2.3 using Lemma 2.1, 
namely 
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2 Verify subgroup for each of the following subsets using Lemma 2.1: 


a. H={f(c)=mr+b : meR*, bE R} < Sym(R) 


b. a=\|4 ] mer beR| <GhalR 


3 Decide whether or not each of the following non-empty subsets of a group G are 
subgroups. If so, carefully justify your answer. If not, provide a counterexample. 


a. G= Sym(N) and H={o0E€G : o(5) =5}. 

b. G=R x R with addition coordinate-wise and H = {(x,y)€G : y= 27}. 
4 Prove that the identity of a subgroup coincides with the identity for the entire 

group. 


5 Prove that the inverse of an element of a subgroup coincides with the inverse of 
this element for the entire group. 


6 Prove each of the statements in Remark 2.2, namely 


a. If H,K <G,then HNK <G. 


b. If H, kK < G, then it is not necessarily the case that H UK < G (i.e. give a 
counter-example). 


c. If H,K <G,then HUK <G iff Either H< Kor K < dH. 
d. For any g € G the set H={g" : nEZ}<G. 


e. The trivial subgroup and the improper subgroup of a group G are indeed 
subgroups of G. 


f. If g € G, then Ce(g) < G. 
g. If X CG, then Ce(X) < G. 
h. If X is a non-empty subset of G, then (X) < G. 


i. If X is a non-empty subset of G, then 


e (X) is the “smallest” subgroup of G containing the set X, i.e. ifH <G 
and H contains X, then (X) < H. 


e (X) is the intersection of all subgroups containing the set X. 


2.3 CYCLIC GROUPS 


In this section, we study an important family of groups called cyclic groups. Later 
on in the text, we will classify completely cyclic groups. 
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Definition 2.6 A group G is cyclic if there is an element g € G such that G = (q). 
In other words, every element of G can be expressed as an integer power of g. We 
call g the generator of G and we say G is generated by g. 


Example 2.6 We give a finite multiplicative example and an infinite additive exam- 
ple of a cyclic group. 


1. The group Z% is cyclic, since (for instance) it is generated by 3. Indeed, if we 
look at successive powers of 3 we see that all the elements of the group are 
attained: 


31=3, 37=2, 33=6, 3¢*=4, 3° =5, 38 =1. 


Note that higher powers of 3 will cycle through the same values in Z? and in 
the same order (hence the name cyclic group). 


2. The group Z with addition is cyclic, since it is generated by 1. Indeed, for any 
n€Z we have 1" =n, since 


1° =1, 1™=14+1+---+1=n (n>0) and 
—— ee 
joP {SD tat) pee el) = =m re 


Note that the generator of a cyclic group need not be unique. For instance, in 
this group —1 is also a generator of Z. 


8. Zn, with addition modulo n is cyclic with generator 1 once again. 


4. The Klein-4 group is not cyclic, since the square of any element in the group 
gives the identity. 


Remark 2.3 There is a connection between cyclic and abelian as is described below. 


1. Every cyclic group is abelian, since if G = (g), then for any g",g™ € G we 
have 


gg" =9 g gg”. 
2. However, not every abelian group is cyclic. Indeed, the Klein-4 group is (the 
smallest) example of an abelian group which is not cyclic. 


Definition 2.7 The order of an element g of a group, written o(g), is the smallest 
positive power of g that attains the identity in the group (should such a power of g 
exist), i.e. 


o(g) =k, if g* =1 and for allm, g™ =1 implies k < m. 


Equivalently, if for some0 <m < k we have g”™ = 1, then m = 0. If no such 
integer k exists we say the element g has infinite order. 
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Example 2.7 We explore the order of an element in several group settings. 


1. Referring Example tr, we see that o(3) = 6. 

2. In Zy2 with addition modulo 12, we see that 0(3) = 4 (exercise). 

8. In Z, every non-zero integer has infinite order. 

4. InGL,(Zp) with p prime and matrix multiplication modulo p being the opera- 


tion, the order of the matrix . ; | is p (exercise). 


Remark 2.4 The order of a group and the order of an element although different 
concepts are related as follows: For any g € G, o(g) = |(g)|. In other words, the 
order of an element is equal to the order of the subgroup generated by that element. 
To see this we show that if o(g) = k, then 


OStLgo ay hs 


Take any g” € (g). Using the Division Algorithm, divide n by k to getn = qk+r 
whereQ<r<k—1. Then 


grag ig ig alg =a 


Lemma 2.2 Let g be an element of a group G with o(g) = k. 
1. If g” =1, then kin. 


2. For any integer n we have that o(g”) = a 


Proof 2.2 To prove the first part, divide n by k to getn = qk+r whereO<r<k-1. 
Notice that g’ = g"—% = g"(g*)~4 = 1. Since k = o(g) it must be that r = 0 and so 
n = qk which implies that k|n. 

For the second part, set d = gcd(n,k) so that we need to show that o(g”) = k/d. 
Since d = gcd(n,k) we can express n = dx and k = dy for some integers x and y 
with gcd(x,y) =1 (see Exercise 11 in Section 1.3). First note that 


GCMea@ hearer, 


Second, suppose z is a positive integer such that (g")* = 1. Thus, g"* = 1 and 
so by the first part of the Lemma we know k divides nz. Hence, nz = kl for some 
integer |. Then dxz = dyl and by cancellation xz = yl and so y divides xz. Since 
gcd(x,y) =1 it follows that y divides z (see Exercise 10 in Section 1.3). Since y and 
z are both positive with y|z we have y < z, t.e. k/d < z. 


Example 2.8 We apply Lemma 2.2 to the group Z%. We’ve seen that 0(3) = 6. Since 
3 is a generator of the group, we can use 3 to find the order of any other element in 


the group. For instance, let’s find the order of 6. We have seen that 6 = 3°. Therefore, 
0(3) 6 6 
6) = 0(3°) = = —_=2. 
19) = 18) = ea(3,0(3)) ~ god(3,6) ~ 3 


Indeed, notice that 6 = 1 modulo 7. 
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Corollary 2.1 If g € G generates G and G is a finite group, then g” generates G 
iff ged(n, o(g)) =1 


Proof 2.3 This is an immediate consequence of Lemma 2.2. 


Example 2.9 By Corollary 2.1, if we find one generator of a group, then we can 
easily find all of them. Consider the group Zi,. One can check that 2 generates the 
group. By Corollary 2.1, 2” generates the group iff gcd(n,10) = 1. Thus, n = 3,7,9 
and so the other generators of the group are 22? = 8, 2° =7 and 2° =6. 


Theorem 2.1 Let G be a cyclic group. 
1. Any subgroup of G is also cyclic. 


2. If G is finite and a positive integer d divides the order of G, then there exists 
an element of order d and a unique subgroup H < G of order d. 


Proof 2.4 We’re given that G = (g) for some g € G. For the first statement, suppose 
that H <G. If H is trivial, then it is generated by 1. Otherwise, there exists positive 
powers of g which are in H. To see this, since H is non-trivial there exists an m 4 0 
such that g™ € H. Since H is a group we know that g-™ = (g™)~! € H. Certainly 
one of m and —m is positive. Let k be the smallest positive power of g such that 
gi eH. 


Claim 2.1 H = (g*). 


The fact that (g*) C H follows since g*§ € H and H has the closure property. 
Now take any h € H. Sinceh € G we can write h = g™ for some integer m. Using 
the Division Algorithm, we have m = qk +r where 0 <r <k. Now g” = g”™-% = 
g™(g*)~4 € H. Since k is smallest positive integer with g* € H, it must be that r = 0. 
Hence, m = qk and so g™ = (g*)4 € (g*) which proves the claim. 

For the second statement, set |G| =n < co and we're given that d divides n. Set 
H = (g"/*). Then H is a subgroup of order d, since 


iia o(g) n Lee 


)= gced(o(g),n/d)  gcd(n,n/d) — n/d 

Suppose K is another subgroup of G of order d. By the first statement, K is cyclic 
and must be generated by some power of g. Set K = (g™). We know 1 = (g™)4 = g™ 
which implies n divides md. Thus, md = nl for some integer | and so m = (n/d)l. 
Therefore, g” = (g"/*)! € H and thus K = (g™) C H. But H and K have the same 
order d and therefore H = K. 


o(g 


Remark 2.5 1. Theorem 2.1.it is not true for an arbitrary group. For instance, 
the Klein-4 group does not have a unique subgroup of order 2, but rather it has 
three such subgroups. Although we will not study these groups until the next 
section, for the record we state that the alternating group A, for n > 5 which 
has order n!/2 has no group of order n!/4 even though n!/4 divides n!/2. We 
will not be able to verify this statement until much later when we show these 
groups are simple (to be defined). 
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2. In general, finding all the subgroups of a given group is no easy matter, however 


in the case of cyclic groups, Theorem 2.1 makes the task quite easy to do. 
Simply generate all the cyclic subgroups and isolate one of each order dividing 
the order of the group. Refer back to the lattice of subgroups of Zz and we see 
that its distinct subgroups are generated by the elements 0, 1, 2, 3, 4 and 6 of 
corresponding orders 1, 12, 6, 4, 8 and 2. 


EXERCISES 


1 


2 


Verify all the order statements made in Example 2.7 


For the group (Z, +), verify that 


a. —1 is another generator of Z 


b. Any other integer not equal to +1 is not a generator of Z. 


Compute the order of every element in Zj2 with addition (modulo) 12. 
Consider the group Z? with multiplication modulo 7. 

a. Show that 3 generates the group. 

b. Use part (a) and Corollary 2.1 to find the order of 2. 

c. Use part (a) and Corollary 2.1 to find all the generators of Z7. 


d. Write out the lattice of cyclic subgroups of Z-. 


Consider the following group with operation being matrix multiplication modulo 


2: 
a={]° : eben Za & ad + be 0 (nod 2) 


a. List the elements of G. 


1 1 
b. Compute H= (| j |): 


-1 
1 0 
c. Compute i | ; 


Prove that in GL,,(Z,) with p prime and matrix multiplication modulo p, the 


order of the matrix : ; | is p (you will need a proof by induction to help 


verify this result). 


Find all the generators of the cyclic group Zj, by finding one by hand and then 
applying Corollary 2.1 to find the rest. 
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8 Prove or disprove the following statement: For any prime p the multiplicative 
group Z,, is generated by 2. 


9 Find all the generators of the cyclic group Z3 by finding one by hand and then 
applying Corollary 2.1 to find the rest. 


10 Create the Lattice of Subgroups for the cyclic group (Zego, +60) making use of 
Remark 2.5. 


11 Create the Lattice of Subgroups for the cyclic group (Zé@,,-61) making use of 
Remark 2.5 (hint: first find a generator of Zé,). 


12. Prove Corollary 2.1 


2.4 PERMUTATION GROUPS 


We focus now in more detail on the symmetric group. This group constitutes one of 
the origins of group theory and is essential. Recall that for any set A the symmetric 
group, Sym(A), is the collection of all bijections from A to A (called permutations 
on A) with the operation being the composition of functions. We narrow our focus to 
finite sets A. In fact, one can narrow further to set A = {1,2,...,n} and the corre- 
sponding symmetric group S,. The reason why we can do this is essentially because 
any permutation on A = {ay, d2,...,@,} corresponds to a unique permutation in Sj. 
For instance, if o € Sym(A) and o(a;) = aj, then the corresponding permutation in 
S,, would send i to 7. As we keep promising, this notion of two groups being essen- 
tially the same will be made more formal later in the text when we define isomorphic 
groups. 

We now introduce notation for representing permutations. The first representation 
takes the form 


Example 2.10 In Ss if o(1) = 2, o(2) =4, o(3) =5, o(4) =1 and o(5) = 3, 
then 


Consider the permutation 


2 el, Bee 3B 
PSN 20 Sela Dey ay 2 
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For brevity we will write ot for aot. When computing the composition, in this 
text, the reader should note that one reads the maps from right to left just as we do 
with composition of functions. Hence, 


4 5 123 45) {123 45 
1 3 ya ess vane les oe ae -  s an 


The second way to represent permutations is by means of k-cycles. A k-cycle in 
Sy, is a special permutation represented as follows: 


3 
5 


OS te. Pe Ug) whe»re [Gye seay teh Gly. Zoaseg Wh: 


Note that, in this text, k-cycles are read from left to right(!). Thus, o is defined 
as follows: 
o(i1) = 19, a(i2) = 13, see o(ix—-1) = Uk, o(tx) = 11. 


Ifm ¢ {t1, 12, ..., a}, then it is understood that o(m) =m, i.e. any other m is 
fixed by a. A 2-cycle is also called a transposition. 


Example 2.11 In S7, ifo = (2436 1), then 
—f123 45 67 
oh ed: Gig. al el 
To find the inverse of a k-cycle simply reverse the order of the numbers in the 


k-cycle. For example, o~' = (1 6 3 4 2). Note that a transposition therefore is its 
own inverse, since 


(a b)"' = (ba) = (a). 
Ifr =(25 4)(4 15 2)(6 5 1 2 4) a composition of three cycles, then 


2 
CSV .9> 4 


Definition 2.8 Two cycles are disjoint if they have no numbers in common. For- 
mally, if o = (a, a2 +++ Gm) and tT = (b; bg --+ by), then AN B = O where 
A= {ay, AQ, «66, Gay and B = {by, bo, ene bn}. 


3 
3 


Lemma 2.3 Disjoint cycles commute, i.e. ifo,7 © Sp are disjoint cycles, then oT = 
TO. 


Proof 2.5 Let o = (a1 a2 +++ Gm) and T = (by be +++ bn) be disjoint cycles with 
sets A and B as in the definition. 


Case 1: Ifc ¢ AUB, then 
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Case 2: Ifa; € A, then a;,0(a;) ¢ B, and thus 


Case 3: If b; € B, then b;,7(bi) ¢ A, and thus 
o(r(b;)) = (bi) = T((bi)). 


Since ot and To agree on every input, they are therefore equal. 


Example 2.12 Consider the following permutation in S41: 


We shall illustrate with this example the result we wish to prove next, namely 
that every permutation can be written as a product of disjoint cycles. We will do this 
systematically so that a general algorithm is evident and can be used in the proof to 
follow. 

Start with 1. Notice that o sends 1 to 3. Then o sends 3 to 4, 4 to 7, 7 to 2, and 
a sends 2 back to 1. So a cycle ino is (1 34 7 2). Now pick the smallest number 
not mentioned in the cycle we constructed, which would be 5. Now o sends 5 to 8, 
which is sent to 6, and 6 is sent back to 5. Hence, a second cycle in a is (5 8 6). The 
smallest number not yet mentioned in both cycles constructed is 9 which is sent to 
11, which is sent back to 9. Hence, a third cycle in o is (9 11). The last number not 
yet mentioned is 10 which is sent to itself. This yields a 1-cycle (10). Therefore, 


(1 347 2)(5 8 6)(9 11)(10). 
Typically, we drop any 1-cycles from the representation and simply write 
(1347 2)(5 8 6)(9 11). 
Theorem 2.2 Every permutation can be written as a product of disjoint cycles. 


Proof 2.6 Let ¢ € S,. Note that for a positive integer k, o* means the k times 
composition of o, and if k = 0, then o° is the identity map. Consider the following 
infinite list: 

i Wat od C10) Pao al Sarak lS 


Since for any k we know o*(1) € {1,2,...,n}, then there are surely repeats in the 
above list. Let r be smallest such that o"(1) is a repeat of an earlier value in the list. 


Claim 2.2 o'(1) =1. 

By assumption there is a k with0 < k <r and o"(1) = o*(1). Suppose to the 
contrary that k > 0. Then o’*(1) = 1 contradicting our assumption that o"(1) is 
the first repeat. Hence, k = 0 and o"(1) = 0°(1) =1 which proves the claim. 
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Therefore, by the claim, one of the cycles in o’s representation as a product of 
disjoint cycles is 
(Co) ede Dea ys 


Certainly, r <n. Ifr =n, then we are done with the proof with o being rep- 
resented by a single cycle. Otherwise, choose the smallest number, say i, not in the 
list 

i od (6 Wat oan C10 Pie as ib Reman alae (IE) 


A similar argument to the one above shows that, for some positive integer s, 
another one of the cycles in o’s representation as a product of disjoint cycles is 


(i o(i) o°(4) o°(i) --- oH). 


Repeat this process of producing cycles until all of the numbers 1,2,...,n are used 
up. We leave it as an exercise to verify that we have produced disjoint cycles. 


Remark 2.6 The representation of a permutation as a product of disjoint cycles is 
unique up to the order of the cycles (since they commute). Although this statement 
might be intuitively obvious, a formal proof should really be provided, however we will 
skip this result. 


Definition 2.9 The cycle type of a given permutation is the length and number of 
cycles in its unique disjoint cycle representation. 


Example 2.13 Consider the permutation in the previous example where the disjoint 
cycle representation was 


o = (1347 2)(5 8 6)(9 11)(10). 


The cycle type for o can be expressed as (*)(**)(* * *)(* * * * x). Typically, the 
cycles are arranged in increasing order of length. 


Theorem 2.3 Every permutation can be written as a product of transpositions. 


Proof 2.7 To see this result, since we already have Theorem 2.2, it suffices to show 
that every cycle can be written as a product of transpositions. Therefore, consider an 
arbitrary k-cycle (a, az -++ , ag). Then 


(ay, dq +++ Gp) = (Gp—1 Az) +++ (G2 Gp)(ay ag). 
Hence, we expressed any k-cycle as a product of transpositions. 
Example 2.14 Consider in S7 the 5-cycle (2416 7). Then 
(24167) =(6 7)(1 7)(4 7)(2 7). 


Now unlike the previous representation as a product of disjoint cycles, the repre- 
sentation of a permutation as a product of transpositions is not unique. Indeed, in 
the previous example one can add two transpositions of the form (1 2) to the end of 
the representation, i.e. 


(24167) =(67)(1 7)(4 7)(2 7)(1 2)(1 2). 
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For that matter we could add four such transpositions at the end, or six, or eight, 
ad infinitum. Although there is no unique representation as a product of transposi- 
tions, there is something that remains invariant with respect to the permutation. 


Definition 2.10 A permutation is called even if it can be represented as a product 
of an even number of transpositions. A permutation is odd if it can be represented 
as a product of an odd number of transpositions. 


Theorem 2.4 A permutation cannot be both even and odd. 


Proof 2.8 The majority of this result is taken up with the fact that the statement 
is true for the identity map. We show that the identity map is only even, t.e. if 
1 = 772-+-T, where each 7; is a transposition, then k is even. This we do now. For 
each number m which appears in the transpositions consider the following reasoning: 
Let j be largest such that m appears in T;. Note that 7 4 1 for otherwise m would 
not be fixed by the identity map. There are four possibilities for the product T;~17;: 
(m x)(m x), (max)(m y), (« y)(m x) or (y z)(m a). For the first possibility, notice 
that we can simply remove the two transpositions from the product. For the other 
three properties, we can rewrite the product so that m appears first in T;1 as follows: 


(m x)(m y) = (m y)(z y), 


(x y)(m x) = (m y)(z y), 
(y z)(m x) = (m z)(y 2). 


In summary, we either remove two transpositions from the product or we move 
the last occurrence of m to the left. In the first possibility repeat the process on the 
next largest occurrence of m (if it exists). In the other three cases repeat the process 
on Tj-2T;-1- Note that for the number m you must eventually be in the first case, 
for otherwise m would appear first in Tt, which as we have already pointed out is not 
possible. Therefore, ultimately we remove m and eventually all transpositions from 
the identity map in a two-by-two fashion until none are left. Hence, the product must 
be comprised of an even number of transpositions. 

Now suppose that o is any permutation and ao = 7179-++T, and 0 = T{Th+++Th 
represented in two ways as a product of transpositions. Equating the two representa- 
tions we have TT2+++Tp = T1T4-++T, and moving them all to one side by multiplying 
by the inverses of the transpositions we have 

1l=T,-:: T2T1T1T) vee ee 
By the work above we know that r+ s must be even, but this implies that either r and 
s are both even or r and s are both odd. 


Definition 2.11 The alternating group, written A, = {0 € S, : o is even}. 
The set On, ={ao€S, : o is odd}. 
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Remark 2.7 We wish to point out several observation regarding Ay. 


1. As suggested by the name, A, is a subgroup of S,. To see this, since Ap is finite, 
it is enough to show that A, satisfies the closure property. But this is apparent, 
since if 0 = 71T2°++T, is a product of an even number of transpositions and so 
is T = 717-+-7h, then 

OT = T1TQ ++ TpT{T3 ++ Tey 


with r +s also even. 
2. By Theorem 2.4, we know that S,, is a disjoint union of An and On. 


3. Exactly half of the permutations in S, (n > 2) are even. To see this it is enough 
to show that |A,| = |O,| which we do by defining a bijection between the two 
sets. Define a function f : An + O, by f(o) = o(1 2). Certainly f maps 
into Oy since it adds one transposition to the end of any even permutation, 
thus making it odd. The function is one-to-one, since if f(o,) = f(o2), then 
o1(1 2) = o9(1 2) and by cancellation in the group S, we have o1 = 09. Finally, 
f maps onto O,, since for any T € O, notice that f(7(1 2)) = 7(1 2)(1 2) = 7. 


We leave as an exercise the following result: 
Lemma 2.4 In the permutation group Sy, 
1. the order of a k-cycle is k. 


2. if o equals a product of disjoint cycles, say 0 = 0102°+:Om, then the order of 
a is the least common multiple of the orders of 01, 02,...,0m- 


Example 2.15 We shall illustrate Lemma 2.4 with several examples. 


1. Consider the permutation 


123 45 67 8 9 
c= (3 5438791 g ) = 125818 467 9 


Therefore, o(a) = lem(4, 2,3) = 12. 


2. Leto = (1 3 7)(2 7 3)(1 4) € S7. Now o is not represented as a product of 
disjoint cycles, so we cannot yet use the second part of Lemma 2.4. We first 
will need to rewrite o as a product of disjoint cycles. One can compute 


Therefore, o is a 4-cycle and as such its order is 4. 


Basic Group Theory @ 47 


3. Using cycle types, we are now in a position to compute the orders of all the 
elements in S, (for a given n) and the number of elements of each order. Let’s 
do this for Ss: 

Cycle Type Order Number 


(+) (4) (4) 20 y= hb 
(x) (x) (* * *) 3 a5 = 20 
(x6) (% 5) 6 25 = 20 
(*) (* * 4x) 4 + = 30 
(«3 & *) 5 2 = 24 


Some explanation is required to understand how the cycle types decided and 
were counted. To determine the cycle types, simply consider all different ways 
(up to commutativity) of expressing 5 as a sum of positive integers (these are 
called the partitions of the number 5). There are two things to consider. First, 
there are k different ways to represent the same k-cycle. To see this, consider 
the example 3-cycle 

C23) =0 12) =125:1) 


Second, if the cycle type contains m disjoint k-cycles, then there are m! ways to 
order them all of which yield the same result (since they are disjoint). Now let’s 
count the cycle type (*)(**)(**). There are 5! ways to fill in the asterisks. Each 
2-cycle can be represented in two ways, thus we divide by 2 twice. Furthermore, 
the two 2-cycles can be ordered in 2! different ways, thus we also divide by 2!. 


Example 2.16 We now make the connection between the dihedral group and the 
symmetric group. 


1. For n = 3, the dihedral group consists of three rotations and three reflections 
of an equilateral triangle. Label the three vertices with the numbers 1, 2 and 3 
(see Figure 2.10). 


The rotations 0°, 120° and 240° will be denoted respectively by po, pi and po. 
The reflections will be denoted by 4, 2 and pig where for i = 1,2,3 the re- 
flection pi; fixes vertex i and swaps the other two vertices. So we can consider 
the elements of D3 as permutations of the numbers 1,2,3. Then the elements 
of D3 are 


_f1 23 Sak os 2 Fat 83 
ae a ee ae a a a ae 
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2 3 


Figure 2.10 Symmetries of the triangle. 


af hoe 3 Lay dy 3 e238 
fs aay Oi es aa ee a Be NOS Bae 


Note first that |D3| = |S3| and so forn = 3 we have D3 = S3. Let’s decide 
which elements are even and which are odd. We know po ts even and 


pi = (1 23) = (2 3)(1 3) p2 = (1 3 2) = (3 2)(1 2) 
fi = (2 3) bg = (1 3) fig = (1 2). 


Thus, we see that the rotations are all even and the reflections are all odd and 
so Az ={po, pi, pat. 


2. For the group D4 we label the vertices of a square with the numbers 1 through 
4. Thus, the elements of D4 are 


if ee 3s A ey fae eae 
saa De Sa | on ee ae 


4 


D> 
i) 

lI 
——N 
wre 

bo 
re Ww 
NO we 
So 
SS 
o 
lI 
a 
ey Re 
re be 
Dw w 
QC 
Ne 


ft LOR a fh: BoB 
BES Ny Ae Be by eel Oe ee 
set oes PL a4 
BES 819 of a a ie ae a 


Note that for n = 4 the subscripts of the pu; do not have the nice relationship 
to the vertices as they did for n = 3. Furthermore, the reader can check that 
half of the rotations are even and half are odd. The same holds true for the 
reflections. Since |D4| = 8 and |S4| = 4! = 24, Dy, is a proper subgroup of S4 
and since |A4| = 12 and D4 has only four even permutations, these four form 


a proper subgroup of Ag. 
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&. For any n > 3 there are several general statements we can make. First of all 


since |D,| = 2n and |S,| = n! it is only for n = 3 that the two groups coincide. 
Otherwise Dp is a proper subgroup of Sy, and the even permutations in Dp form 
a proper subgroup of An. Secondly, It is always the case that the rotations are 
generated by py. 


EXERCISES 


Oo CO NN DOD oO K 


Explain why the size of S;, is n!. 


Consider the following permutations in S¢: 


Compute ot, Ta, 0! and 71. 
b. Express each of o and 7 as a product of disjoint cycles. 
c. Express each of o and 7 as a product of transpositions. 


d. Decide if each of o and 7 is even or odd. 


Consider the permutation group So and the following two elements: 


,-(12345 6789 10 
TAY de De 8) SB TOR. Os fe 2G 


F=(1 245 6)2 68) 
a. Write o as a product of disjoint cycles. 

b. Use part a to compute the order of a. 

c. Decide whether a is even or odd. 

d. Compute o7 and write your answer in the same form as a was given. 
Verify that the cycles produced in Theorem 2.2 are disjoint. 

Prove Lemma 2.4. 

Apply Lemma 2.4 to find the order of o in Example 2.12. 

In S11, apply Lemma 2.4 to find the order of (1 3 2 6 8 11)(5 7 9 10) 


Decide which of the elements in D4 are even and which are odd. 


For 0,7 € Sy, prove the following statements: 


T(a1 a2 +++ a~)t + =(rT(a1) T(az) +--+» T(ag)). 


b. o and tor~! have the same cycle type. 


c. Cycle types correspond exactly to conjugacy classes for the group S,,. Hence, 
the table we constructed earlier for Ss is, in fact, the table of conjugacy 
classes for Ss. 
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2.5 PRODUCTS OF GROUPS 


In this section, in two different settings, we will construct new groups from old ones 
in each case by means of a product (to be defined). In the first case we will consider 
two arbitrary groups and in the second case we will consider two subgroups of a given 


group. 


Definition 2.12 Let (G,*) and (G’, x’) be two groups. The (external) product of 
G and G’, has as its objects the elements of the Cartesian product of G and G’, 


Gx G={(9,9') : g EG, J eG}, 
and has as its operation the coordinate-wise product, 


(91, 91) (92; 95) = (91 * 92,94 x! 92). 


One can easily generalize this construction to any finite number of groups. For 
brevity we replace G x Gg x --+ x Gy by 


n 
[[ Gi = {la 92,---, 9n) : ore Gi, go © Ga, 20) G9n © Gy}. 
i=l 


Example 2.17 Consider the external product Z4 x Z= x Qg. For illustration we 
compute the product (2,3, k)(3,4,7) = (2+43,3-74,kj) = (1,5, -2). The identity in 
this group is (0,1,1). 


Proposition 2.2 The order of (91, 92,---,9n) € Tf, Gi is the least common multi- 
ple of the orders of the individual g; € G; fori =1,2,...,n. 


Proof 2.9 We shall prove it for two groups and the result easily generalizes by in- 
duction on n. Let (g,g') € G x G’ and set m = o(g), n= o(g') andl = Iem(m,n). 
Thus, | = ma andl = nb for some a,b € Z. First of all, 


(9, 9')' = (9', (9')") = (C9), ((9')")") = 1, 1”) = 2,1). 
Now suppose that for some positive integer we have (g,g')" = (1,1’). Then 
(g’, (g/)") = (1,1) and so g” = 1 and (g')"’ = 1’. By Lemma 2.2.1, it follows that 


m and n both divide r. In other words, r is a common multiple of m and n and so 
by definition the least common multiple 1 <r. 


Example 2.18 We illustrate Proposition 2.2 with several examples. 
1. The order of (2,4,i) € Za x Z x Qg = Icm(2, 3, 4) = 12. 


2. Consider the group Z4 x Ze. Since both groups are cyclic, Z4 has elements 
of orders 1, 2 and 4 and Ze has elements of orders 1, 2, 3 and 6. Since in 
G x G’ we have o(g,g') = lem(o(g), o(g')) this group has elements of orders 
1, 2, 3, 4, 6 and 12. Note, therefore, that Z4 x Ze is not cycle since it has 24 
elements. 
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Now we present the second construction promised in this section. 


Definition 2.13 Let (G,*) be a group and X,Y two subsets of G. The product set, 
written 
XY ={xuxy : cEX, yEY}. 


If X = {x} we write cY for {x}Y. 
Proposition 2.3 For any two finite subgroups H, K of a group G, we have 


_ IAMIK 
|JHN K| 


| A 


Proof 2.10 Define the following relation on Hx K: (h,k) ~ (h',k’) iff hk =h'k’. 


One can easily verify that this is an equivalence relation. 
Claim 2.3 The class of (h,k), i.e. [(h,k)] = {((he7, zk) : 2 € HONK}. 


First note that (hz—!,zk) € [(h,k)], since hz-'zk = hk and so (hz71,zk) ~ 
(h,k). On the other hand if (h'k’) ~ (h,k), then hk = h'k’ and so (h')~th = k’k7}. 
Sei z=(h)h=kKk + GA K so that (hk) = (hz, zk). 

The Claim shows that the equivalence classes all have the same size as HN K. 
Therefore, the size of the quotient set (H x K)/ ~ is the size of H x K divided by 
HK. Now consider the map 


fi (Hx K)/~—+ HK by f({(h,k)]) = hk. 
This map is well-defined and one-to-one, since 
f(a, k)]) = F(R RD) @ hk = hk! & (hk) ~ (WR) @ [(h, k)] = [(h', F)].- 
Now this map certainly maps onto HK and so 


JZ x K| _ |AIIK 
IHN K| |HnkK| 


|HK| = |(H x K)/~| = 


Example 2.19 Now it is not always the case that for H, K <G that HK <G. For 
instance, take G= S3, H = (41) and K = (p2). Then 


AK S {Po, fi, H2; Hi pla $ = {Po, [1, b2, pr}. 


Then HK is not a subgroup of G, since for instance papi = po € HK. Indeed, 
the fact that HK #4 KH is the reason why it is not a subgroup, as we shall see in the 
neat result. 


Theorem 2.5 For subgroups H, K of a group G, the following are equivalent: 


1HK<G. 
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2 HK =Kd. 


8 KH<G 


Proof 2.11 It’s enough to show that the first two statements are equivalent, since 
the second statement is symmetric. First assume that HK < G. Take kh € KH, 
then kh = ((kh)~1)-1 = (h-'k1) 1 € HK, since HK < Gandh'k"! € HK. Now 
take an hk € HK. Note that k-*h~! = (hk)! € HK, since HK < G. So we can 
write k-'h-1 = h'k' © HK. Then 


hk = ((hk) ) t= (k 1, *) es ed ae oe c KH. 


Now assume that HK = KH. Take hk,h'k' © HK. Note that h'(k’/k~!) € HK = 
KH, so we can write h'k’'k7! = k"h" Then 


(WR (hk) = WR On = ka" 6 KA = AK. 
Remark 2.8 A few remarks are in order for the product HK. 


1. HK is called the internal product of H and K, since the construction occurs 
within the group G, whereas the external product of two groups yields a new 
group which lies outside both of the two groups in its construction. 


2. An immediate Corollary to Theorem 2.5 is that in an abelian group the product 
HK is always a subgroup of G. In this case the product is often represented as 
a sum, t.e. H+ K, and is called the internal sum. 


Example 2.20 We give two examples of an internal product/sum. 


1. Consider the multiplicative group G = Zig with subgroups H = (4) = 
{4, 16, 7,9,17,11,6,5,1} and kK = (8) = {8, 7,18, 11, 12,1}. Note that HONK = 
{7,11,1}. Since G is abelian we know that HK is a subgroup of G and the size 
of HK, by Proposition 2.8, is ae = 18. Thus, we know, in fact, that HK =G. 


2. Consider the additive group G = Z with subgroups H = mZ and K = nZ. We 
show that the internal sum H+ K is the subgroup dZ where d = gcd(m,n). So, 
for instance, 4Z+6Z = 2Z. To see this takex € H+ K and write x = ma+nb. 
Since d divides both a and b, then d divides ma+nb = x. Hence, x = dy € dZ. 
Now take x € dZ and write x = dy. We know there exists integers xo and yo 
such that d= mxp + nyo and so 


x = dy = m(yxo) + n(yyo) € H+ K. 


Definition 2.14 Let G be an abelian group. 


1. When H,K < Gand HN K =0 (by this we mean HN K = {0}), we write 
HOK instead of H+ K to signify this fact and it is called an internal direct 
sum of H and Kk. 
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2. More generally, if H,,Ho,...,Hy, < G and for each i = 1,2,...,n we have 
A; (A, +--+ + Ai-1+ Hi4i +---+ An) = 0 we write HO H,®---@ Ay 
instead of Hy + H2+---+ Hy to signify this fact and it is called an internal 
direct sum of H,, Ho,..., Hn. 


Lemma 2.5 The following are equivalent for G an abelian group with subgroups 
A, Flo, See , Ay: 


1.G=H,@H2e::-@®Hy 


2. For every g € G there exist unique hy € Hy, ho © Ho, ..., hyn € Hn such that 
g= hy that thn. 


Proof 2.12 We show that the first statement implies the second. Let g € G. The 


existence of hy © Hy, ho © Ha, ..., hn € Hn such that g = hy thot+---+hn follows 
immediately, since we know G = H, + H2+---+H,. To see uniqueness, suppose in 
addition g = ky +ko+---+kn for some ky € Hi, ko © Ho, ..., kn € Hn. Then for 
each 1 = 1,2,...,n we have 


kz — hy = (ha — by) +++ aa — Wega) + Chat — hg) +++ + (an — Ap), 


which is in H,O (Ay +---+ Ajy-14+ Higa +--+: + An) = 0. 

Therefore for each i = 1,2,...,n we have hy — ky = 
uniqueness. 

Now we show that the second statement implies the first. The fact that G = 
A, + Ho---+ H, follows immediately. Thus, it remains to be proved that for each 
i = 1,2,...,n we have Hj (A, +--- + Ay-1 + Aina +--+ + An) = 0. Suppose 
g€ A,(A4+--- 4+ Ai-14+ Aigit::-+ An. Now g can be represented as an element 
in H, + Ho +---+ Hp in two different ways. 


0 or hy = ky which proves 


g=0+4+---+0+9+0+---+0, where g is in the ith position, and 


gahate thir +04 higi +--+ +2, 


for some hy € My, +--+, hi-a € Ai-a, hisn € Higa, .--, hn € Hn. By unique 
representation we can equate the corresponding entries in each of the sums, so in 
particular we get g = 0. 


EXERCISES 


1 Verify that the external product of any two groups is again a group. 


2 Using induction and the proof in the case of two groups, prove Proposition 2.2 
in general. 


3 Consider the group D4 x Qs. 


a. Compute the orders of all the elements in D4 and Qs. 
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b. Compute the order of (1,7). 


c. What are the possible orders of elements in D4 x Qs? 


4 Verify that the relation defined in the proof of Proposition 2.3 is an equivalence 
relation. 


5 Consider the multiplicative group G = Zj, with subgroups H = (4) and K = 
(6). 
a. Compute the elements of H, K and HN K. 
b. Compute the size of HK and decide if HK =G. 


6 Let G be an abelian group and H, K < G. Prove that if gcd(|H|,|K|) = 1, then 
H+K=HOK. 


7 Let G bea group and H, kK < G. Suppose every element in G can be expressed 
as a product hk, wherehe€ H andke K. 


Prove that if HM AK = {1}, then every in G can be expressed uniquely as a 
product hk, i.e. for all g € G there exists a unique h € H and unique k € Kk 
such that g = hk. 


2.6 HOMOMORPHISMS 


Every algebraic structure has its associated functions which in some sense respect the 
algebraic operation(s) of the structure, and groups are no exception. In this section, 
we explore these functions and provide ample examples to illustrate how they work. 


Definition 2.15 Let (G,*) and (G’,*’) be two groups. A function 6: G—> G' is a 
group homomorphism if for all g,h € G, we have 


o(g *h) = o(g) *' o(h). 


One says that the function respects the group operations. In more words, it says 
that the image of the product of two elements is the same as the product of the two 
images. 


Example 2.21 We give several examples of a group homomorphism with their ver- 
ifications. 


1. Consider the groups (R*°,-) and (R,+) (note that R*° signifies positive real 
numbers). The map @ : R*° + R by ¢(a) = Ina is a group homomorphism, 
SINCE 


@(ab) = In(ab) = Ina+1nb= (a) + (0). 


2. Consider the groups GL,(R) with matrix multiplication and (R*,-). The map 
@:GL,(R) > R* by $(A) = |A| is a homomorphism, since 


P(AB) = |AB| = |Al|B] = 9(A) (8). 
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We point out that @ maps into R*, since our inputs are invertible matrices and 
as such have non-zero determinant. 


. Consider the additive groups Z and Z/ =n. The map @: Z > Z/ =n by 


o(k) = [k]n is a homomorphism, since 


o(k +m) = [k+ mn = [K]n + [Mn = O(k) + O(m). 


. Consider the multiplicative groups C* and R*°. The map ¢: C* — R*® by 


o(z) = |z| is a homomorphism (see Exercise 1 at the end of this section), since 


b( 2122) = |z122| = |Z1||Z2| = O(21) (22). 


. Consider an infinite cyclic group G = (g) (i.e. the order of g is infinite) and 


the additive group Z. The map 6: G > Z by o(g”") = n is a homomorphism, 
since 


o(gg") = og") =m+n= $(g™) + o(g”). 


. For any abelian group G and integer n, the map 6: G > G by $(g) = g” is a 


homomorphism (left as an exercise). 


. For any group G, the identity map 1g: G > G by le(g) = 4g for allg € G is 


a homomorphism. 


. For any two groups G and G' with identities 1 and 1’ respectively, the trivial 


map 6: G-—> G" by d(g) =1' for all g € G is a a homomorphism. 


Lemma 2.6 Let G, G’ and G" be groups and ¢: G > G’, w: G! = G" be group 
homomorphisms. Then 


1. 


am &w~ GW ® 


el) = 1. 

. For allg € G we have ¢(g~') = o(g). 

. The composition wood: G— G" is a group homomorphism. 

_ If H <G, then ¢(H) < G’, i.e. the image of a subgroup is a subgroup. 

. If H' < G”, then @1(H') < G, ie. the inverse image of a subgroup is a 
subgroup. 

. If G is cyclic, then so is o(G). 

. If G is abelian, then so is $(G). 


Proof 2.13 We shall prove the first two statements and leave the rest as exercises. 


To see the first statement is true, notice that 


o(1) = (1-1) = 60) 60), 


and so multiplying both sides by $(1)~! yields 1’ = (1). 
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For the second statement, notice that 


o(9-")o(9) = og *9) = 001) = 1, 


and likewise 6(g)¢(g-') = 1’. Thus, ¢(g~!) is the inverse of ¢(g) which is exactly 
what we wish to show. 


We now introduce some terminology regarding homomorphisms which shall be 
used in the remainder of the text. 


Definition 2.16 A group homomorphism ¢: G— G' is called 
1. amonomorphism if ¢ is one-to-one. 
2. an epimorphism if ¢@ maps onto G’. 


3. an isomorphism if ¢ is both one-to-one and maps onto G’. 


Definition 2.17 A homomorphism from a group into itself is called an endomor- 
phism and an isomorphism in this case is called an automorphism. A monomor- 
phism is sometimes referred to as an embedding and we say such a map embeds 
a copy of G in G’. 


Example 2.22 An important class of automorphisms are called inner automor- 
phisms and are defined as follows: Let g € G a group and define the map ig: G—+ G 


by ig(a) = gag! which is an automorphism, since 
ig(ab) = g(ab)g* = gag‘ gbg"* = ig(a)ig(b). 


We now define an important substructure associated with homomorphism. 


Definition 2.18 Let ¢ : G > G’ be a group homomorphism. The kernel of ¢, 
written 


kerg={g EG : o(9) =1'}. 


Figure 2.11 represents a generic picture of a homomorphism ¢ from a group (G, 1) 
to another group (G’, 1’) and the associated structures just introduced. 


Example 2.23 Consider the homomorphisms already presented in Example 2.21. 
Bear in mind it is important to keep track of what the identity element is in the 
codomain in order to compute the kernel of a homomorphism. 


1. Recall 6: R*° > R by ¢(a) = Ina. A positive real number 


a€kero iff d(a)=0 iff Ina=0 iff a=1. 
Therefore, kerd = {1}. 
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Figure 2.11 Visual representation of a homomorphism and its associated structures. 


2. Recall ¢: GL, (R) > R* by ¢(A) =|Al. An invertible matrix 
Aékerd iff oA) =1 if |A|=1. 
Therefore, ker@ = SL,(R). 
3. Recalld: Z> Z/ =n by O(k) = [k]n. An integer 
kéekerd iff o(k) =([0)n if [Kn =[Oln tf K=n0 iff nk. 
Therefore, ker@ = nZ. 
4. Recall ¢: C* 4 R*° by $(z) = |z|. A complex number 
zé€kerd iff o(z)=1 iff |z|=1. 
Therefore, geometrically, the kernel of ¢ is the unit circle in the complex plane. 
Lemma 2.7 Let ¢: G— G' be a homomorphism. Then 
1. 1 € ker. 
2. kerdO<G. 
3. dis a monomorphism iff kerd = {1} (written more simply kero = 1) 
In which case we say the kernel is trivial. 


Proof 2.14 The first statement follows immediately from Lemma 2.6.1 and the sec- 
ond statement is left as an exercise. To prove the third statement first assume that 
@ is one-to-one and take any k € kerd. Thus, o(k) = 1’. By the first part, we know 
that o(1) = 1’ as well. Therefore, k = 1 and the kernel is trivial. Now assume that 
the kernel is trivial and suppose $(g) = ¢(h). Then 6(g)¢(h)~+ = 1! and, using prop- 
erties of a homomorphism, ¢(gh~') = 1'. Since the kernel is trivial it must be that 
gh-1 =1 and so g=h. 
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Example 2.24 Consider again the homomorphisms already presented in Exam- 
ple 2.21. The homomorphism ¢ : R*° + R by ¢(a) = Ina is one-to-one for we 
found that the kernel of @ was trivial. The remaining three homomorphisms have 
non-trivial kernels and therefore are not one-to-one. 


Definition 2.19 Let G be a group. 
1. The automorphism group of G, written 


Aut(G) ={¢@:G—>G : ¢ is an automorphism}. 


2. The inner automorphism group of G, written 
ImiG)={ig2GoG =: ge}. 


Proposition 2.4 Let G be a group. Then 
1. Inn(G) < Aut(G) < Sym(G) (where the operation is composition) 
2. The map tv: G — Inn(G) by i(g) = ig is an epimorphism. 


Proof 2.15 To prove Aut(G) < Sym(A), by earlier work we know that if ¢,w € 
Aut(G), then so is w—' € Aut(G) and thus so is dow! € Aut(G). To show Inn(G) < 
Aut(G) requires a bit more work. We show for any g,h € G that ig” = ig-1 and 
tg Ot, = ign. First, for alla eG, 


g 


ig(ig-1)(@) = igh(g™ Jag") "] = tg(9"'ag) = g(g"'ag)g~* = a. 
Similarly, ig-1(ig(a)) = a and so ig-1 is the inverse of ig as required. Second, for all 
aeG, 

ig(in(a)) = ig(hah~') = g(hah~")g~* = (gh)a(gh)~" = igh(a). 
Hence, the result has been proved, since for all g,h € G we have igi;, | = ign-1 € 
Inn(G). 

The map v certainly maps onto Inn(G) by its very definition. To show it is a 

homomorphism, notice that ugh) = u(g)i(h) since we have shown that igh = igin- 


We will now prove Cayley’s Theorem which states that any group G can be 
embedded in a symmetric group, namely Sym(G). 

For each g € G consider the map A, : G > G by A,(a) = ga. It is easy to check 
(see exercise) that each A, is a bijection and hence an element of Sym(G). The Greek 
letter A stands for left multiplication by an element of G. 


Example 2.25 Consider the multiplication table derived earlier for Z4 (see Fig- 
ure 2.7 of Section 2.1). Each row of the body of the table corresponds to the outputs 
of the functions we just defined, namely r1,Ag,An and Ax. For instance, 
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Proposition 2.5 (Cayley’s Theorem) A group G can be embedded in Sym(G) 
and thus every group is a subgroup of a permutation group. 


Proof 2.16 Consider the map 1: G — Sym(G) by X(g) = Ag. To show XA is a 
homomorphism we need to show for allg,h € G that \(gh) = A(g)A(h) or equivalently 
Agh = AgAn- But this is true, since for alla € G, 


Agh(@) = (gh)a = glha) = Ag(ha) = Ag(An(@)). 


Finally, X is a monomorphism, since its kernel is trivial. Indeed, if g € kerA, 
then A(g) = 1a, the identity map, so for alla € G we have X4(a) = a. In particular, 
Ag(1) = 1 or equivalently g = 1. 


EXERCISES 


1 For any real complex number z = a+ bi € C we define the magnitude of z, 
written |z| = Va? + 67. As a vector (a,b) in the complex plane,|z| is simply the 
length of that vector. Prove that for any 2, 22 € C we have |z129| = |z1||z2I. 


2 Let @ € Aut(G). For each of the following subgroups of H < G, show that 
o(H) CH 


a. H={geG : ga=ag, Va€c G}, the center of G. 


b. H is the only subgroup of order n for some fixed value n. 


3 Prove that for any abelian group G and integer n, the map ¢: G — G by 
0(g) = g” is a homomorphism. 


4 Prove that for any group G, the identity map 1g: G > G by le(g) = g for all 
g € Gis a homomorphism. 


5 Prove that for any two groups G and G’ with identities 1 and 1’ respectively, 
the trivial map ¢: G > G’ by ¢(g) = 1’ for all g € Gis a homomorphism. 


6 Prove the remaining parts of Lemma 2.6. 
7 Prove Lemma 2.7.2. 


8 For each g € G consider the map A, : G > G by A,(a) = ga. Prove that Ag is 
a bijection. 


9 As we did in Example 2.25 express all the Ag maps as permutations with their 
corresponding cyclic representation for the Klein-4 group. 


10 Consider the element 7 € Qs. 


a. Via Cayley’s Theorem identify 7 € Qg with a permutation, then express this 
permutation as a product of disjoint cycles and determine its order. 


b. Using your work in part a, express the permutation as a product of trans- 
positions and decide whether it is an even or odd permutation. 
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2.7 ISOMORPHIC GROUPS 


We now look at isomorphisms more closely and the notion of isomorphic groups which 
is a way of identifying two groups as essentially equal. To begin, we note that certain 
properties of the domain of an isomorphism are carried over and preserved in the 
codomain. 


Lemma 2.8 Let 6: G — G" be a group isomorphism. 
1. If G is abelian, then so is G’. 
2. If G is cyclic, then so is G’. 
3. Ifg € G has order k, then so does (gq). 


Proof 2.17 We have seen already in Lemma 2.6 that the image of a homomorphism 
is abelian (or cyclic) if the domain is abelian (or cyclic) and since in our case @ maps 
onto G’ the first two statements hold. To prove the third statement, first note that 


o(9)* = o(9*) = 0) = 1. 


Second, if for some positive integer n we have o(g)" = I’, then o(g”) = 1’. Thus, 
g” € ker@ which is assumed to be trivial. Therefore, g” = 1. Since o(g) = k it must 
be thatk <n. 


Definition 2.20 Two groups G and G’ are said to be isomorphic, written G = G’, 
if there exists an isomorphism 6: G — G’. 


Example 2.26 We have seen some of the content of these examples earlier in the 
tert. 


1. Recall the map ¢: R*° > R by ¢(a) = Ina which we have already shown is a 
monomorphism. Furthermore, @ maps onto R, since for any real number y the 


positive real number e¥ is mapped by @ onto y, i.e. d(e”) = IneY = y. Therefore, 
R°°SR. 


2. Consider the following set of matrices: 


b 
G= a a | : a,b€R, but not both equalling cero ; 
We leave it as an exercise to show that G together with matrix multiplication 
forms a group. We show that this group is isomorphic to the non-zero complex 
numbers under multiplication. To do this we have to come up with a map from 


C* to G which is an isomorphism. This can be quite an creative process, but in 
this case the map is easy to find. Define ¢: C* > G by 


(at bi) = a ak 
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Now we need to check three things: That $ is indeed a homomorphism, that 
ker@ is trivial and that @ maps onto G. This we leave as an exercise. 


3. A messy but true fact is that if A is a set of size n, then Sym(A) = Sp. 


Remark 2.9 Isomorphic groups are basically equal in the sense that they share all 
the same group properties and have the same multiplication table — the names of 
the elements in each group might be different and the operations might be different, 
but the way in which the operation relates to the group elements are identical for 
both groups. More formally, isomorphism is an equivalence relation on the collection 
of groups and each equivalence class of this relation consists of the subcollection of 
groups which have been identified as isomorphic. 


Therefore, an easy way to determine when two groups are not isomorphic is to 
exhibit a particular group property they do not share. We summarize some of these 
properties in a lemma the proof of which follows immediately. 


Lemma 2.9 Let G and G’ be two groups. 
1. If one group is abelian and the other is not, then G FG’. 
2. If one group is cyclic and the other is not, then G ¥ G’. 
3. If |G| 4 |G'|, then G FG’. 
4. If one group has an element of order k and the other does not, then GF G’. 
5. If G has n elements of order k and G’ does not, then G & G’. 


Example 2.27 Consider some of the groups we have encountered up to this point in 
the teat. 


1. The groups Z4 and the Klien-4 group are not isomorphic, since Zq ts cyclic, but 
the Klien-4 group is not. 


2. The groups Ze and S3 are not isomorphic, since Ze is abelian, but S3 is not. 
3. Ze $ Zs, since |Ze| =6 f a= |Zs|. 


4. The Dihedral Group, D4, is not isomorphic to the Quaternions, Qs, since D4 
has elements po, P1, P2; P3> H1, 12, 13, bla Of orders 1,4, 2,4, 2,2,2,2 (respectively) 
and Qs has elements 1,—1,i, -i, 7, —j,k, —k of orders 1,2,4,4,4,4,4,4 (respec- 
tively). So for instance, D4 has five elements of order 2 while Qg has only one. 


One important aspect of abstract algebra is the notion of classification. We 
consider groups sharing a certain property and try to discover the complete list 
of specific groups which have that property — more precisely we list the distinct 
equivalence classes under the relation isomorphism. Therefore if P is some property, 
then the classification of groups with property P will consist of all the distinct non- 
isomorphic groups which have property P (ones says distinct up to isomorphism). 
We will do this now for cyclic groups. 
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Theorem 2.6 Consider a cyclic group G. 
1. If G is a finite cyclic group of order n, then G = Zn. 


2. If G is an infinite cyclic group, then G = Z. 


Proof 2.18 For the first statement we shall show that any two cyclic groups of order 
n are isomorphic and thus by transitivity of the equivalence relation isomorphism the 
result follows. Let G = (g) and G’ = (g') be two cyclic groups of order n. Define the 
map ¢: G—> G' by o(g*) = (g')*. Since elements of G can be represented in multiple 
ways as a power of g we need to check that the map is well-defined. In other words, 
if g§ = g™, then o(g") = (g™). To see this, since g* = g™, then g”—* = 1 and son 
divides m—k. Write m—k=nr for some integer r. Note that 


Therefore, (g’)* = (g')™ and so ¢(g*) = o(g™). Now we show that ¢ is a homo- 
morphism. 


o(g*g™) = o(g't™) = (g/)**™ = (g')*(g')™ = 0(9") O(9”)- 


The kernel of this map is trivial, since if o(g™) = 1’, then (g')™ = 1 and so 
n divides m. Write m = nk. Then g™ = g™ = (g”)\* = 1* = 1. Finally, @ maps 
onto G’, since for any (g')™ € G" the element g™ maps onto (g')™. Hence, ¢ is an 
isomorphism and thus G & G’. 

Now suppose that G = (g) is an infinite cyclic group. We leave as an exercise the 
fact that the map 6: G > Z by ¢(g") = n is a homomorphism. Since g has infinite 
order, the kernel of @ is trivial and the fact that 6 maps onto Z is immediate from 


the definition of o. 


Theorem 2.6 says there is only one cyclic group — up to isomorphism — of a given 
cardinality. Therefore, all cyclic groups have been classified. 


Example 2.28 Let’s apply this classification result to some concrete examples. 


1. Consider the following three cyclic groups of order four: Z4, Zz and the sub- 
group of D4 consisting of all the rotations of the square (generated by the 90° 
rotation). By the theorem just proved we know that all three are isomorphic. 


2. The cyclic groups Z/ =, and Z, are isomorphic, since they have the same 
order. 


3. Consider the infinite cyclic subgroup of IR* defined by H = {x : ne Zh. By 
the second part of the theorem just proved we know that H = Z. 


A fact about finite abelian groups which is useful to note and which we shall 
prove later (its proof requires theory we do not yet have) is the following: 
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Theorem 2.7 (Classification of Finite Abelian Groups I) If G is a_ non- 
trivial finite abelian group, then G is isomorphic to a direct sum of non-trivial cyclic 
subgroups of prime power order. Furthermore, this direct sum representation is unique 
up to isomorphism and order. 


Example 2.29 Let’s apply Theorem 2.7 to a couple of examples. 


1. The abelian groups of order 8 are Zg, Za ® Zz and Zz © Zo © Zo, since these 
are the only groups (up to isomorphism) of order 8 which are direct sums of 
cyclic groups which have orders a power of 2. Therefore, there are five groups 
(up to isomorphism) of order 8 — the three just mentioned as well as D4 and 
Qs (one really should check there are no other non-abelian groups of order 8, 
but this is non-trivial). Thus, we have classified groups of order eight (give or 
take a few verifications). 


2. Consider groups of order 72 = 2°37. Considering that subgroups of such a group 
can only have orders 2,27, 2?,3,3? the only abelian groups (up to isomorphism) 
of order 72 are 


ZgPLZy, LZOL2Z3023, Z4G2,.0L25, 2402.0 23 0Z3. 


Ly BLZy06LZ20L29, LZo62,62, 823 G Z3. 


Lemma 2.10 The following are equivalent for G an abelian group with subgroups 
MM, Ho,..., Hn andG=H,+ Ho+-:-+ An: 


1.G=H,0A28::-@ Ay 


2. For every g € G there exist unique hy € Hy, ho € Ao, ..., hyn € Hn such that 
ga=hyt+het+--++hn. 


3. GSH, x Ay x.--- x Ay via the map (hy + ho +--+ + hn) (hy, ha,..., An). 


Proof 2.19 In Lemma 2.5, we proved the first two statements are equivalent. Now 
we show that the second statement is equivalent to the third statement. Assuming 
the second statement, it is straightforward to show that the map given in the third 
statement is an isomorphism. In fact, the uniqueness in statement two insures that 
the map is well-defined. 

Assuming now that the map in the third statement is an isomorphism we show 
the second statement holds. Again existence of a representation follows, since G = 
Ay, + Ho +---+ Hy. Finally, we show unique representation. Suppose there is a 
g€ Gand hy, € Ay, ho € Ao, ..., hn € Hy such that g = hy +ho+---+hy and 
ky € My, ko © Ao, ..., ky © Hy such that g = ky + kg +---+ ky. Since the map 
is well-defined, it must be that hy + hg +---+hyn and kj + kg +---+ky are sent to 
the same place, 4.€:-(hi, Wip0005 My) = CKiykagns phy) ond Sohy — hy he Shay deny 
hyc= Kx. 
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EXERCISES 


1 Consider the following set of matrices: 


G= . : | : a,b €R, but not both equalling wero} ‘ 


a. Show that G together with matrix multiplication forms a group. 


b. Define ¢: C* > G by 


Verify that ¢ is an isomorphism and thus conclude that G = C*. 
2 Prove that each of the following pairs of groups are isomorphic: 


a. (Zy6,+16) and (Zjz, +17) 


b. : | :neZ with matrix multiplication and (Z, +). 


c. Qs and the following matrix group G: Let 


0 1 0 2 
a=| 4 and B=|¢ al 


Set G = (A, B), the subgroup of M22(C) (under matrix multiplication) 
generated by A and B. 


d. The two subgroups listed in Exercise 2 for Section 2.2. 
3 Prove that if A is a set of size n, then Sym(A) = S;, 


4 Prove that the notion of isomorphic groups forms an equivalence relation on the 
collection of groups. 


5 Suppose that G = (g) is an infinite cyclic group and define the map ¢: G > Z 
by o(g”) =n. Prove that ¢ is a homomorphism. 


2.8 COSETS OF A GROUP 


We now step up the level of complexity a bit by defining a very important equivalence 
relation on a group. This results ultimately in a new group whose elements consist of 
the equivalence classes of this special equivalence relation. This is one of the reasons 
why equivalence classes were gone over so carefully in the first chapter — to make the 
understanding of this chapter easier. 
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Figure 2.12 The cosets of an arbitrary group G. 


Consider a subgroup H of a group G. This equivalence relation will be dependent 
on our choice of H and is defined as follows: For gi, g2 € G, 


g1~ 92 iff 9,92 € H. 


The fact that this relation is an equivalence relation is strongly contingent on the 
fact that H is a subgroup of G. Indeed, to show ~ is reflexive, that g ~ g relies on 
the fact that 1 € H. To show symmetry, we need the fact that H is closed under 
inverse. To show transitivity, we need the fact that H has the closure property. We 
leave it to the reader to verify these three properties. 

Now let’s look at the equivalence classes formed by this equivalence relation. For 
GEG, 

Il={ €G i gnrg}={ eG : g'g € A} 
={g €G: 9 ‘9 =h forsome fe A} 
={g'€G: g =gh forsome he H} 
={gh : forall he H}. 


So the class of g consists of g times every element in H. We shall employ the 
product notation gH for this class and it shall be called a left coset of G with 
respect to or modulo H. In a similar manner one can define right cosets. 

Since cosets are equivalence classes we know by our work in Chapter 1 that they 
partition G. In other words, G is a union of left cosets and any two cosets are either 
disjoint or coincide (Figure 2.12). 


Example 2.30 We construct the cosets of a group in some specific settings. 


1. Consider the multiplicative group G = Z= and the subgroup H = {1,6}. The 
subgroup H which is the left coset 1H is one left coset. In order to form a 
new coset choose an element of G not in 1H, say 2 (otherwise we will get H 
again — this is a property of equivalence classes which we saw in Lemma 1.1 of 
Section 1.1). Then 2H = {2,5} is a second left coset. Now choose an element 
in G not in the first two cosets already formed, say 3. Then 3H = {3,4}. At 
this point we have exhausted all the elements in G and so we know these three 
left cosets are the complete list of unique cosets of G modulo H. 
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Figure 2.13 The cosets of Dy modulo (11). 


2. Consider the additive group G = Z and the subgroup 3Z = {0,+3,+6,...}. 
We will use additive notation for these cosets. The first coset is the sub- 
group 3Z = 0+ 3Z. Choose 1 ¢ 3Z and compute the second coset 1 + 3Z = 
{...,—5,-2,1,4,...}. Choose 2 which is not in the first two cosets already 
formed and compute 2+ 3Z = {...,—4,—1,2,5,...}. At this point we have 
exhausted Z and so these three cosets are the complete list of cosets modulo 3Z. 
Notice that we obtained the same three cosets when we computed the equivalence 
classes of Z with the equivalence relation congruence modulo 3. Indeed, 


3Z = [0]3 1+3Z = [l]s 2+ 3Z = [2Js. 


It is precisely because of this connection to congruence modulo n that the cosets 
are referred to as cosets modulo H. 


3. Consider the group D4 and the subgroup (111) = {po, Hi}. Once again the sub- 
group itself, (111) = po(pl1) is one coset. Since pz is not in the subgroup we form 
the second coset 


H(t) = {H2po, Hap} = {He2,( 12 (34 (24 )} 


= {u2,(1234)} = {p2, pr}. 


Now choose an element in D4 not in the first two cosets already formed, say 
pz. Then the third coset is 


L3({1) = {13Po; Mbt = {L3, po}. 


Finally, choose an element in D4 not in the first three cosets already formed, 
say fg. Then the third coset ts 


a(i1) = {Hapo, Matti} = {pa p3}- 


At this point we have exhausted all the elements of D4 and have the complete 
list of cosets of D4 modulo (,11) (see Figure 2.13). 
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Here are some useful properties of the left cosets of G modulo H which are easy to 
verify: 
1. The subgroup # is always one of the cosets. 


2. For any g1,92 € G we have g,H = gH iff go € g,H. In other words, if you 
choose an element of a coset and form its coset you will get the same one. Any 
other choice will get you a new one. 


3. For any g eG, ge AH iff gH =H. 


Definition 2.21 Let H < G. We denote the collection of cosets of G modulo H by 
the notation G/H and the number of distinct cosets of G modulo H by |G: H] which 
is called the index of H in G. 


Example 2.31 For the following examples the reader should refer to Ecample 2.30. 
1. Z3/A = {H,2H,3H} and |Z? : A] =3. 


2. Dj 3h= {3Z, 1+ 3Z,2+3Z} and (|Z: 3Z] 5a 
3. Da/(p1) = {(H1); Ha(ea), M3 (Hi); Ha(ua) and [Dag : (p11)] = 4. 


You may have noticed in our examples that cosets of a given group always had 
the same size. This observation hints at a theorem which is attributed to the Italian 
mathematician Joseph-Louis Lagrange and is a fundamental result in group theory. 


Theorem 2.8 (Lagrange’s Theorem) For any finite group G with subgroup H we 
have |G| =[G: H]|A. 


Proof 2.20 First note that any coset of G modulo H has the same size as H (and 
thus all cosets have the same size). To see this take any g € G and consider the map 
f : H > gH by f(h) = gh. It is easily seen that this map is a bijection between 
the two sets and hence they have the same size. Now since G is finite and is the 
union of its cosets, it must be the case that the number of cosets |G : H]| is finite. Set 
n=[G: H] and write G as a disjoint union of cosets, t.e. G = g, HUgHU-:-UgnH. 
Then 


IG] = |g. + |goH| +--+ + gn] = |A| + |H| +--+ |H] = nlf] =([G: AA. 
SS 


n 


Example 2.32 We give some simple illustrations of Lagrange’s Theorem (refer to 
Example 2.30). 


1. |Z3| = |Z? : H]|H|, since 6 = 3-2. 
2. [Dal = [Da : (un)]lKu)], since 8 = 4-2. 


One can already see the importance of this theorem by the results listed below 
which immediately follow from it. 
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Corollary 2.2 If G be a finite group, then 
1. The order of any subgroup H < G divides the order of the group G. 
The order of any element g € G divides the order of G. 
For any element g € G we have g'@ = 1. 
For any prime p and any integer n 4 0 we have nP-! = 1(mod p). 


For any chain of subgroups K < H<G we have |G: K] =[G: H][H: kK]. 


SP GM & Sf & 


If the order of G is prime, then G is cyclic. 


Proof 2.21 The first statement is an immediate consequence of Lagrange’s Theorem 
which says that |G| is an integer multiple of |H|. The second statement then follows 
from the first, since o(g) equals the order of the subgroup it generates, t.e. o(g) = |(g)|. 
The third statement follows from the second, since we can express |G| = o(g)k for 


some k € Z and so 
gil = gear = (9°) ee 


To see the fourth statement consider the multiplicative group Z, and the result 
follows from the third statement. For the fifth statement we apply Lagrange’s Theorem 
on the following three subgroup relations: H<G, K < H and K <G to get 


IG| = |G: Hl] A| |H| = [H: K]|K| IG| = [G: K]|K]. 
Putting these three equations together we have 
IG: B||K| = |G) = (4: Al |A|=(G 2 A]|A : kK) |K\. 


Cancelling |K| from both sides of the above equation yields the result. For the 
sitth statement, since the order of the group is a prime, p say, we can take ag #1 
in the group. Now since o(g) 4 1 and we know o(g) divides the order of the group, 
it follows that o(g) equals p and so the cyclic subgroup (g) has the same size as the 
entire group, thus making the group cyclic. 


Remark 2.10 The last statement in Corollary 2.2 leads to an important classifica- 
tion result. It says that there is only one finite group (up to isomorphism) of any 
given prime order and that group must be cyclic. Hence, we have completely classified 
groups of prime order. 


EXERCISES 


1 Verify that the relation defined at the beginning of this section and which pro- 
duced cosets as equivalence classes is indeed an equivalence relation. 


2 Verify the following properties of the left cosets of G modulo H: 


a. The subgroup H is always one of the cosets. 
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b. For any gi, g2 € G we have g,H = goH iff go € gH. 
c. Foranyg€G, ge A iff gH =H. 


3 For each group G and subgroup H, compute the cosets of G modulo H. 


a. Gis the Klein-4 group and H = {1, g} (refer to its multiplication table given 
in Figure 2.7). 


by G= $3 and A = (iy). 
é. G= Dy and A = (pa): 
d. G= Qs and H = {41}, 


4 For H <Gandgé€G, verify that f : H > gH by f(h) = gh is a bijection. 


2.9 FACTOR GROUPS AND NORMAL SUBGROUPS 


We return now to the cosets we constructed in Section 2.8. Our goal is to take this 
cosets and make them the elements of a new group. We will only be able to do this 
if we add a condition on the subgroup under which we are making these cosets (to 
be presented shortly). There are many good reasons to want to construct this group 
of cosets. One reason among many is that since the group of cosets will have smaller 
size than the original group we can thus utilize this group when we are proving facts 
by induction on the size of a group. Another reason is that the group of cosets can 
sometimes give us insight into other groups to which it might be isomorphic. 

Recall that if H < G, then G/H = {gH : g ©€ G}, the set of cosets. The most 
natural way to define an operation on cosets would be to multiply the representatives 
using the original group operation, i.e. (g1H)(g2H) = (g1g2)H. This seems to work 
quite effectively, since it easily satisfies the four axioms of a group. Indeed, this follows 
since the original group satisfies the four axioms: 


1. The product of two cosets is another coset. 
2. The identity coset is 1H or simply H, since (gH)(1H) = (g1)H = gH. 
3. (gH)-1 = 91H, since (gH)(g- 1H) = (gg 1)H =1H. 


4. Associativity follows, since the representatives associate: 


(91H) (g2H)](93H) = (9192) H (93H) = [(g192) 93] 
= [91(9293) | = (91) (9293) = (917) |(92)(934)]. 


The reader may be therefore thinking what is the trouble with always defining this 
operation on the cosets? The potential problem lies in the fact that, as equivalence 
classes, cosets have multiple representations and since our operation relies on the 
representation, we want to be sure that it is not dependent on it. More formally, this 
operation can be viewed as a potential function from G/H x G/H to G/H and we 
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need to be sure that this function is well-defined. In other words, if g;H = gH and 
g2H = 95H, then we want to guarantee that (g1H)(g2H) = (9, H)(95H). 

Let’s first explore an example where things go terribly wrong in order to motivate 
the need for a necessary and sufficient condition for the coset operation to be well- 


defined. 


Example 2.33 Consider the group G = S3 and the subgroup H = (t1) = {1, pr}. 
One can check that G/H = {H, p2H, w3H} where w2H = {h2, po} = poH and 
13H = {u3, pi} = pH. Notice that 


(2H) (W3H) = (HoM3)H = pH while (p2H)(p1H) = (p2p1)H = 1H = HX. 


So we see in this case that the resulting coset produced by the product of two cosets 
is dependent on the way in which we represent them. Hence, these cosets cannot form 
a group using the definition of multiplication of cosets which was just defined. 


We now explore this coset multiplication more carefully in order to extract a 
necessary and sufficient condition on H for it to be well-defined. Now since g, H = gH 
and goH = g5H, this implies gi € g,H and g) € goH and so gi = gyhy and gh = goh2 
for some hy,h2 € H. In order to have (g1H)(g2H) = (9,H)(g5H) or equivalently 
(9192) H = (9\.95)H, we need gig € (9192) or equivalently gj. = gigeh for some 
h € H. By substitution, we need gihig2h2 = gigeh or hygoh2 = goh or go hige = 
hhz' € H. So it would appear that the condition we need for the coset operation to 
be well-defined is the following property which we introduce formally. 


Definition 2.22 A subgroup H of a group G is normal, written H <G, if for all 
g€Gandhe H, we have g-'hg € H. 


Example 2.34 Consider the group 


o={] 4 ed : a,beE Zs, zo} 


ond the selgroup H= (1 4 i : ce tah. 


We demonstrate that HAG. This follows, since 


ee lewleeels clloa|ee| 


_ | at ate—b a b oy eikee (ae ee 
~|§ a NE (3 1 Jew 


We now prove the necessary and sufficient condition for coset multiplication to 
be well-defined. A majority of the proof comes from the exploration we just did. 
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Figure 2.14 The quotient group multiplication table for Examples 2.35.1. 


Theorem 2.9 Let H be a subgroup of G. The set of cosets under the operation 
(1H) (92H) = (gig2)H forms a group iff HG. 


Proof 2.22 First assume that HAG. As we have seen above it is enough to show that 
the coset operation is well-defined for the cosets to form a group. Therefore, suppose 
that 9, H = gH and goH = gH. Then gi € gH and gb © goH and so gi = gihi 
and gi = gghz for some hy,hyg € H. Since HAG we know that gyhigo € H and 
80 9a hige = h for some h © H. Thus, higg = goh and so gqhygoho = gigehha 
or equivalently gig, = gigohh2 € giggH. Therefore, gigsH = gig2H or equivalently 
(9,4) (92H) = (91H) (92H). 

Now assume that the cosets form a group under the coset operation. Take g © G 
and h € H. Then 


g thg = g thgl — g (\HgH = (g‘g)H —-1H—dH. 
Therefore, H AG. 


Remark 2.11 We point out that in an abelian group the problem of well-definition of 
the coset operation is not an issue, since in an abelian group every subgroup ts normal. 
Indeed, if H < G abelian, g€ G andh € H, we have g''hg=hg-'g=heH. 


Definition 2.23 Suppose the H a G. The set of cosets under the coset operation 
forms a group called the factor group or quotient group of G with respect to H. 


Example 2.35 We present several examples of factor groups. 


1. Let G = Zé with multiplication modulo 7 and H = {1,6}. Since G is abelian 
we know that H a Gand so we may consider the factor group G/H. The cosets 
for this factor group are H, 2H = {2,5} and 3H = {3,4}. The multiplication 
group for this factor group is presented in Figure 2.14. Since there is only one 
group (up to isomorphism) of order three, it follows that G/H = Zs. 


a= {| 5 a : a,b € Zs, a0] 


2. Consider the group 
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Figure 2.15 The quotient group multiplication table for Examples 2.35.2. 


ond the subgroup HT = 4| § | : ce tah. 


Now G is not abelian, so one needs to first check that H is normal in G, however 
in Example 2.84 we proved this very thing. We now construct the elements of 
the factor group G/H. Note that |G/H| = |G: H] = |G|/|H| = (4)(5)/5 = 4, 
so there will be four cosets in this factor group. There is, of course the identity 
coset H. The other cosets are 


For brevity, set 


_f20 [30 i242? 
BW ys age fe ed ie ogy |, EE og dic 


We leave it to the reader to verify the multiplication table for this group pre- 
sented in Figure 2.15. From our knowledge of the multiplcation tables for groups 
of order four it is evident from the table that G/H = Z4. 


&. Let G=Z with addition and consider the subgroup H = 3Z. Now 3Z<Z, since 
G is abelian. We have already computed 


Z,/3Z, = {3Z, 1 + 3Z, 2 + 3Z}, 


and the coset operation is addition of representatives used in the underlying 
group Z. The addition table is given in Figure 2.16. Once again, we have a group 
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3Z 


1+ 3Z 


Figure 2.16 The quotient group multiplication table for Examples 2.35.3. 


of order three and so we know Z/3Z ~ Z3, the only group (up to isomorphism) 
of order three. But, in fact, in this case the identification of the groups is much 
more apparent, since coset addition of representatives is exactly addition modulo 
three. More generally, we have the identification of three representations of the 
same group, namely, for any positive integer n, 


The next result is important in group theory and has analogous results when 
studying other algebraic structures. It makes a strong connection between groups 
and factor groups. 


Theorem 2.10 (Fundamental Theorem of Group Homomorphisms) Let ¢: 
G — G’ be a homomorphism and set K = ker ¢. Then G/K = ¢(G). Furthermore, 
if @ is an epimorphism, then G/K & G'. 


Proof 2.23 The proof is straightforward and direct. We simply produce the map 
which makes the two groups isomorphic. Define UV: G/K — $(G) by U(gk) = ¢(g). 
First of all, the following shows that V is both well-defined and one-to-one: 


gK =hK & gh eK & O(gh)=1' & dg)d(h)l=V & O(g) = GA). 
The map V is a homomorphism, since 
W(gKhK) = U(ghk) = o(gh) = $(9)o(h) = Ugh) Uh). 


The fact that V maps onto ¢(G) is immediate from its definition, and certainly 
if @ is an epimorphism, then ¢(G) = G' and so G/K =G". 
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Figure 2.17 The map ¢ for Example 2.36.3. 


Example 2.36 We illustrate Theorem 2.10 with several examples. 


1. Recall the epimorphism ¢ : GL,(R) > R* by ¢(A) = |A| for which we com- 
puted ker@ = SL,(R). Since 6 maps onto R* we know $(GL,(R)) = R*. 
Therefore, by the Fundamental Theorem of Group Homomorphisms (FTH), we 
have GL,,(R)/SL,(R) = R*. This isomorphism turns out to be a useful result 
for counting the size of SL,(F’) (over a different finite set of scalars F'}) and 
this we do later on in the tect. 


2. Recall the epimorphism @: Z— Z/ =, by o(k) = [k]n for which we computed 
ker 6 = nZ. By FTH, we have once again that Z/nZ = Z/ =n. 


3. Recall the epimorphism o : C* + R*° by 6(z) = |z| for which we computed 
ker d = S', the unit circle in the complex plane. By FTH, we have C*/S! = 
R>°. In this case the identification of the two groups as being isomorphic is 
intuitive in the sense that the factor group has cosets which consist of complex 
numbers of the same length and visually what is happening is that the punctured 
(without the origin) complex plane is being collapsed onto the positive real axis 
circle-by-circle, i.e. for all real numbers r > 0, all the points on a circle of 
radius r in the complex plane are sent to r on the positive real axis (see Figure 


2.17). 


4. Recall the epimorphism uv: G + Inn(G) by (g) = ig where ig(a) = gag! for 
alla eG. Let’s find the kernel of this map. 


g€kerte © ug)=l1le © Foralla€e G, t(g)(a)=a 


© For alla € G, ig(a) =a & For alla€G, gag =a 


& Forallae G, ga=ag © gE Z(G). 
Hence, kert = Z(G) and by FTH we have G/Z(G) = Inn(G). 


Basic Group Theory @ 75 


/ lat \ \ 


P(g) 
Figure 2.18 A commutative diagram illustrating that Vov = @. 


5. Consider the identity map 1g and FTH to see that G/{1} = G and consider 
the trivial map (the map that sends every input to the identity) and FTH to see 
that G/G is isomorphic to the trivial group. 


Remark 2.12 We make some additional remarks about the Fundamental Theorem 
of Homomorphisms (ETH). 


1. When G is finite, |G| = |ker@||¢(G)]. 


2. For those who have taken a linear algebra course, there is a similar result for 
vector space dimension and a linear transformation T with domain a vector 


space V. 
dim(V) = dim(kerT) + dim(T(V)). 


But notice that the linear algebra result is the logarithmic equivalent of the group 
theory result. 


We now present another map which connects factor groups and groups and which 
shows up in the study of any algebraic structure. 


Definition 2.24 Let N<G a group. The mapv:G—> G/N by v(g) = gN is called 
the canonical (or natural) map. 


Recall we have already defined this map in a more general context when we 
introduced equivalence classes — it was a map which set every element in the set to 
its equivalence class. The map just defined is simply a specific example in which the 
set is a group and the equivalence classes are cosets. It is a simple matter to verify 
that the canonical map is an epimorphism and we leave it as an exercise. This map 
plays an underlying role in two important facts. The first is in FTH, for we can relate 
the maps V : G/K — ¢(G) and ¢: G > G’ in the proof as follows: Vov = ¢, and 
we have the picturesque commutative diagram in Figure 2.18. 

The second fact involving the canonical map is Theorem 2.11. 
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Theorem 2.11 A subgroup H of a group G is normal iff H is the kernel of some 
homomorphism with domain G. 


Proof 2.24 Assuming H <G, consider the canonical map v: G > G/H by v(g) = 
gH. Then the kernel of this map is H, since 


g€kerv & v(g)=H & gH=H © ged. 


If H is the kernel of some homomorphism, it is certainly normal, since the kernel 
of a homomorphism is always a normal subgroup (left as an exercise). 


2.9.1 Semidirect Products 


One way to generalize the direct product is the semidirect product which has internal 
and external types just as the direct product does. This topic is slightly off the beaten 
path, and may be skipped, however it is included for completeness. 


Definition 2.25 Let H <GandN<G with NOH =1. The internal semidirect 
product of N and H, written N x H, represents the subgroup NH < G. 


Remark 2.13 A couple of remarks are in order at this point. 


1. Since NAG, we have hN = Nh for allh € H, and so NH = HN, from which 
it follows that Nx H<G. 


2. We can describe explicitly the product of two elements in Nx H. If nyhy, nghe2 € 
N x H, then 
(n1h1)(n2h2) = [ny (hyngh7')|(hyhe). 


Notice that multiplication in the semidirect product is very close to the multi- 
plication in the direct product except that nz is conjugated by hy. 


Definition 2.26 A group G splits if there exists H < G and NAG such that 
G=N»mH. 


Example 2.37 These examples will explore the idea of a group splitting. 


1. Let G be the group of all invertible upper triangular matrices with matrix mul- 
tiplication. Let U be the subgroup of unipotent matrices and T be the subgroup 
of invertible diagonal matrices. Note that 


o-{[¢¥,]  aseneg, 
Be ee (CP 


We leave it to the reader as an exercise to verify that T <G, UaG, G=UT 
andU NT =1 and so G splits asG=U™T. 
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2. Dn, reflections and rotations on a regular n-gon, splits as Dy, = (p) x (4), where 
p=(12--: n) and p is any reflection. Now (p)< Dy, since it has index 2 in Dn 
(see Exercise 14). Since (p) consists of all the rotations, its intersection with 
(4) ts trivial. To show Dy, = (p)(), it is necessary to show that Dy, = (p, 1) 
and use that fact that pp = p-'. We leave it as an exercise for the reader to 
work out the details. 


3. It’s easy to check that S, = Ay ™ (1). 
4. The cyclic group Zyn does not split. 


5. The quaternions do not split. 


Remark 2.14 The semidirect product N x H induces a homomorphism ¢: H > 
Aut(N) definied by [¢(h)](n) = hnh7! (left as an exercise). This observation allows 


us to generalize the semidirect product and produce the external semidirect product. 


Definition 2.27 Let G and G" be groups and suppose we have a homomorphism 
@: G’ + Aut(G). The external semidirect product of G and G’ via ¢, written 
G x4 G" has as its underlying set the cartesian product G x G' with multiplication 
defined as follows: For 91,92 € G and gj, 95 € G’, 


(91, 94) (92,92) = (gilb(94)] (92), 91.92): 
Remark 2.15 The reader should verify the following remarks. 


1. It’s easy to check that Gx 4G"! is a group. The identity is (1,1') and (g,g/)~1 = 
(o(9)*(g-*), 9!) *). 


2. If @ is the trivial homomorphism, then G x4 G" is just the usual external direct 
product, namely G x G’. 


Example 2.38 Set G = (x) be a cyclic group of order 3 and G’ = (y) be a cyclic 
group of order 2. 


1. Consider the homomorphism ¢ : G' — Aut(G) defined by [6(y)|(x*) = a~* 
(which is an automorphism on G, since G is abelian). The semidirect product 
G x4 G’ in this case is not abelian. Indeed, 


(x, y)(2", y) = (x[¢(y)|(@), y?) = (47,1) while 


(27, y) (x,y) = (@*[O(y)](2), 9") = (@, 1). 
Since |G x4 G'| = |G||G’| = 6, it must be that G x4 G’ = S3. Indeed, to 
get S3 directly take N to be the subgroup generated by p, the 120° rotation of 
the equilateral triangle and H be the subgroup generated by any reflection wu. 
Then @: H —+» Aut(N) maps each element of H to the corresponding inner 
automorphism, i.e. [¢(1)|(p) = up = p-'. 
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2. Now let @: G’ — Aut(G) be the trivial homomorphism. As stated in the remarks 
above this semi-direct product G x4 G' is just the direct product G x G! = 
Z3 xX La = Ze the other possible group of order 6. 


The above construction can be generalized to G = Z, = G" for some positive 
integer n and one can show the two groups you get are D, and Zon. 


Remark 2.16 Just as in the case of abelian groups every internal direct sum of 
subgroups is isomorphic to the external direct product of the same subgroups, so too 
is it true in the case of semidirect products. Indeed, any external semidirect product 
of groups can be viewed as an internal direct product of isomorphic copies of these 
groups. 


We now prove the details of this remark. Suppose that G = N »4 H. Consider 
the embeddings (i.e. monomorphisms) a: N > G and 8: H > G defined by a(n) = 
(n, 1H) and B(h) = (1n,h). Set N’ = a(N) and H’ = B(#A). First, G = N’ x H’, 
an internal semi-direct product of N’ and H’. The only thing that really needs to be 
checked is the normality of N’ in G, but this is clear since 


(n,h)-*(n', Lt)(n, A) = ([6(h)] (2), A) (n"'[ O(n), Lh) 
= ([o(h)J-* (2), A) (n'n, b) = (P(A (2) [6(h™ )(n'n), Li) © 
Notice also that conjugating an (n, 177) in N’ by and element (1y,h) in H’ yields 
(1y, h)(n,1a)(1n, hk) = (An [6(h)] (2), hla) (OCA) in’), 2) 


= ([9(A)](n), hb), h™) = ([0(A)](m)[9(h)] Cy), Liz) = ([6(h)] (2), 1). 


Hence, conjugating N’ by H’ in the internal semidirect product induces the orig- 
inal automorphism which defined the external semidirect product. 


EXERCISES 


1 Consider the group D, and the subgroup H = {1, p}, where p is a 180° rotation 
of the square. Verify that H < D4. 


2 Verify the multiplication table given in Example 2.35.2 
3 For G = Qs and H = {+1, +7} what is the value of |G: H]? 


4 Consider the subgroup H = {+1}. 


a. Compute the elements of Qs/H and the value of [Qs : H]. 


b. Write out the multiplication table for Qg/H and decide to what group Qs/H 
is isomorphic? (justify) 


5 Let G = Zi; and H = (3). 


(ee) 
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a. Why do we know that H<«G? 

b. Compute the elements of G/H. 

c. Make the multiplication table for G/H. 
d. What group is G/H isomorphic to? 


Consider the group Z3 x Z3 (the operation is multiplication). 


a. Construct the lattice of subgroups. 
b. What well-known group must Z3 x Z3 be isomorphic to? (explain why) 


c. Consider the following group: 


a={| 5 ‘i 0c 25 be mah. 


Define the map ¢: G —> Z3 x Z3 by 


| =a. 


Check that ¢ is a homomorphism and then apply FTH in this setting. 


d. Let K = ker in part c. List the elements of G/K and write its lattice of 
subgroups. 


Consider the map ¢: Z x Z > Z by $(m,n) = m — 3n. 


a. Verify that ¢ is an epimorphism. 
b. Compute ker ¢. 
c. Express ker @ as a cyclic subgroup of Z x Z. 


d. Apply the Fundamental Theorem of Homomorphisms to obtain an isomor- 
phism between two groups. 


Consider the group Ag. 


a. List all cycle types, the number of each type and their orders. 


b. Let H = {1, (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}. What well-known group is 
H isomorphic to? (explain why) 


c. Form the elements of A,/H and create the multiplication table (you may 
assume that H < Ay). 


d. What well-known group is A4/H isomorphic to? (explain why) 
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10 
11 


12 


13 


14 


15 


Consider the following group: 


a. Compute the cyclic decomposition for each element of G. 


123 4 
b. Compute = ({ rae ) 


c. Compute G/H. 


Verify that the canonical map is a group epimorphism. 


Prove that for any homomorphism ¢: G > G’ that the kernel of ¢ is normal in 
G. 


Prove the following three statements are equivalent for H < G. 

a. HAG 

b. For all g € G we have g-'Hg = H. 

c. For all g € G we have gH = Hg. 

Prove that if H < Gand |G: H] = 2, then H dG using the following steps: 


a. If 91,92 € G, then gig € H (consider the cosets gj 'H and gH). 
b. Define the map ¢: G — {+1} by 


ike: “ah peS et 
=| 1, ifg¢H 


Prove by cases that ¢ is a homomorphism. 


c. Consider the kernel of ¢ to conclude that H dG. 
Prove that if H < Gand |G: H]| = 2, then H dG using the following steps: 


a. Define an equivalence relation on G which produces right cosets. You must 
verify that it is an equivalence relation. 

b. Prove in two cases that for all g € G that gH = Hg. 

c. Conclude that H dG. 

In Definition 2.4 we defined a particular equivalence relation on a group G. 


Prove that if H is a union of equivalence classes of this relation (i.e. a union of 
conjugacy classes), then H dG. 
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16 Referring to Example 2.37.1, verify that T< G, UG, G=UTandUNT=1 
and so G splitsasG =U xT. 


17 Work out the details of Example 2.37.2. 
18 Verify the following statements: 
a: Se = Ag (a): 
b. The cyclic group Zpn does not split. 
c. The quaternions do not split. 
19 Verify that the map definied in REmark 2.14 is indeed a homomorphism. 


20 Verify the statements in Remark 2.14. 


2.10 NORMAL AND SIMPLE GROUPS 


We first summarize the many equivalent ways to represent the normal property for 
a subgroup, most of which was already proved in Section 2.9. 


Lemma 2.11 The following are equivalent for a subgroup H of a group G: 
1. HAG. 
2. For allg € G we have g"'Hg CH. 
3. For allg € G we have g-'Hg = H. 
4. For allg € G we have Hg © gH. 
5. For allg € G we have Hg = gH. 


Example 2.39 We now give some examples of normal subgroups. All of these ex- 
amples have been previously verified as being subgroups, so we need only check the 
normality property. 


1. In any group G the trivial group and the entire group are always normal sub- 
groups. Note that G/{1} ={{g} : 9 © G} =G and G/G = {G} is isomorphic 
to the trivial group. 


2. Any subgroup of an abelian group is normal. 


3. SIn(R)<GL,(R), since for A € GL,(R) and B € SL,(R) we have |A~'BA| = 
|A|~?|B||A| = | A]? |A]|B] = |B] = 1 and so A1BA € SL,(R). 


4. An<ISp, since ifo € S, and t € Ay, then o~'ro will always be a product of an 
even number of transpositions and so in An. Now forn > 2, [Spy : An] = 2 so 
there will be two cosets in the factor group S;,/An, namely the even permutations 
A, and the odd permutations (1 2)A,. Since there is only one group up to 
isomorphism of order two we also know that S;,/An = Zo. 
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5. For any group homomorphism ¢: G— G' it is always the case that ker@ AG, 
since for allg € G andk € ker@ we have 


og *kg) = o(9-*)O(k)O(9) = O(9) *V9(9) = V, 
and so g~'kg € ker@. 


6. Z(G) AG, since for allg € G and z € Z(G) we have g°''zg = zg ‘g=z€ 
Z(G). 


Next we present a wonderfully surprising and enlightening result related to the 
center of a group. 


Lemma 2.12 If G/Z(G) is cyclic, then G is abelian. 


Proof 2.25 Set Z = Z(G) and we are assuming that G/Z = (gZ) for some g € G. 
Take any a,b € G. Since aZ,bZ € G/Z this implies that aZ = (gZ)™ = g™Z and 
bZ = (gZ)”" = g"Z for some integers m and n. Therefore, a € g™Z andb € g"Z and 
so a=g"z, and b= g"z for some 21, z2 € Z. But then 


—_ om, on, _ p.~mon _— am+n _— ~n+tm _ onm TAA 3 
ab=Qhaga2= 9" Pam = Gan =P = Gg" 22m = Gg 2g" A = ba. 


We now introduce a family of groups that can be considered the building blocks 
of groups in a similar way to which prime numbers are the building blocks for natural 
numbers. 


Definition 2.28 A non-trivial group G is called simple if it has no non-trivial 
proper normal subgroups. 


Hence, you can in addition view normal subgroups as group factors (and hence 
the name factor group for the collection of corresponding cosets). To summarize, a 
simple group is a group whose only normal subgroups are the trivial subgroup and 
the improper subgroup, just as a prime number has factors only 1 and itself. 


Example 2.40 For any prime p the group Zy is simple. To see this, suppose H (Zp. 
Then in particular H < Z, and so |H| divides |Z,| = p. Since p is prime, this implies 
|H| = 1 orp, i.e. H =1 orZ,. In Theorem 2.12, we show that these are, in fact, the 
only abelian simple groups. 


Theorem 2.12 If G is an abelian simple, then G = Z, for some prime p. 


Proof 2.26 Suppose that G is abelian simple. First note that as such G has no non- 
trivial proper subgroups, since in an abelian group every subgroup is normal. First, 
we show G must be cyclic. Indeed, take any g 4 1 in G and consider the subgroup (q). 
Since g #1 it must be the case that (g) = G, and thus G is cyclic. Second we note 
that G must be finite, for if G were infinite, then (g?) would be a non-trivial proper 
subgroup of G. Hence, |G| =n < co, and so G = Z,, (see Theorem 2.6.1). Finally, 
n must be prime, for otherwise n would have a proper non-trivial positive divisor 
d#1,n which in a cyclic group implies there exists a non-trivial proper subgroup of 
G of order d (see Theorem 2.1.1i). 
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Simple groups come in several varieties. There are first of all the abelian simple 
groups which we just classified. The non-abelian simple groups for the most part 
are clustered into families. We shall explore some of these families later in the text, 
namely A, is simple for n # 1,2,4 and SL,(K)/Z(SL,(K)) where K is a field 
(most of the time). The latter simple groups are one family of simple groups arising 
from a larger family of simple groups of Lie type. Then there are the non-abelian 
simple groups which do not fall into any family of simple groups. These miscellaneous 
simple groups are called the sporadic groups. The largest of these groups is called 
the monster group (the second largest is called the little monster) and has order 


Oh 0 Be OL oP 1 O82 20 Sle Alot e504 71 8 RST" 


EXERCISES 


1 Prove that in any group G the trivial group and the entire group are always 
normal subgroups. 


2 Let G be a finite abelian group such that p divides |G|, but p? does not divide 
|G|, for some prime p. Set N={geEG : g?=1} 


a. Prove that N dG. 


b. Prove that G/N has no elements of order p. 
3 Prove that if H, kK <dG and K < H, then the following is a well-defined homo- 
morphism: 


@:G/K > G/H by o(gK) = gH. 


4 Show that if |G| = p? where p is a prime and |Z(G)| > p?, then G is abelian. 


2.11 THE GROUP ISOMORPHISM THEOREMS 


In this section, we explore some of the fundamental structural properties of quotient 
groups. We have already seen such a result when we presented the Fundamental The- 
orem of Homomorphisms. This result is, in fact, a portion of the First Isomorphism 
Theorem which we shall see in this section. There are three Isomorphism Theorems 
which we present herein. First, we need a preliminary result. 


Lemma 2.13 Let N be a normal subgroup of a group G. 


1. If H is a subgroup of G such that N < H < G, then H/N is a subgroup of 
G/N. If in addition H is normal in G, then so is H/N normal in G/N. 


2. If H is a subgroup of G/N, then there is a subgroup H of G with N<H<G 
and H = H/N. If in addition H is normal in G/N, then so is H normal in G. 


Proof 2.27 It is easy to see that H/G < G/N, since for hiN,h2N € H/N we 
have (h1N)(hgN)~! = (hihy')N € H/N because H < G. Assuming H 1G we get 
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H/N «G/N since for hN € H/N and gN € G/N we have (gN)"\(hN)(gN) = 
(g-thg)N € H/N, because HG. 

For the second statement, let H < G/N. We need to construct H which will 
satisfy the conclusions of this result. To this end, set H ={gEG : gN € H}. First 
note that H < G, since H = v~'(H) (recall v is the canonical map which sends g 
to gN) and we know the inverse image of a subgroup is again a subgroup. Second, 
H contains N since N = kerv and N € H. Finally, we show that H = H/N. If 
gN € H, then v(g) € H and sog Ev 1(H) =H. Hence, gN € H/N. IfgN € H/N, 
then g € H which implies v(g) € H and so gN € H. 


The following illustrates how Lemma 2.13 can be applied to prove nice results. 
But first we prove another lemma. Note that the proof of this lemma will illustrate 
how, in the case of finite groups, we can use factor groups and induction to achieve 
a result. This is a classic type of argument in finite group theory. 


Lemma 2.14 If G is a finite abelian group whose order is divisible by a prime p, 
then G must have a element of order p. 


Proof 2.28 We use induction to prove this lemma, t.e. assume that all groups of 
smaller order than G have the desired property of this lemma (note that the lemma 
holds vacuously for the trivial group). If G is also cyclic, then the result follows from 
an earlier result on cyclic groups(Theorem 2.1.1). So assume that G is not cyclic. 
Take any1#g€G and set H = (g). Since G is assumed not cyclic, we know that 
H is a proper subgroup of G. Note that |G| = |G/H||H|, so since the prime p divides 
|G|, it must divide |G/H| or |H|. If p divides |H|, then by induction H, and therefore 
G, has an element of order p and we are done. However, if p divides |G/H| we have 
a bit more work to do. Again, by induction, G/H has an element, say aH, of order 
p. This means (aH)? = H and aH ¢ H or equivalently a? € H anda ¢ H. Consider 
the map @: H + H by o(h) = h? which is a homomorphism, since G is abelian. 
Now if the kernel of @ were nontrivial, then there would an1 #h € H with o(h) = 1. 
Then this h would be an element of order p and we would be done. So assume the 
kernel of @ is trivial. This implies that @ is one-to-one and so, since H is finite, @ also 
maps onto H. Therefore, in particular, there is an h € H which @ maps onto a?, i.e. 
hP = a?. But then ah~' is an element of G of order p, since (ah~')? = aP(h?)-“' =1 
and ah~! #1 (since otherwise a=h, but h € H while a ¢ H). 


Corollary 2.3 If G is a finite abelian group and d divides the order of G, then G 
has a subgroup of order d. 


Proof 2.29 Lemma 2.14 does most of the work for us. Once again we prove this 
result by induction on the order of the group. If G is trivial, then the result follows 
immediately (d = 1 only). Assume that G is non-trivial and let p be a prime divisor 
of d. By Lemma 2.14, there exists g € G of order p. Set K = (g), a subgroup of order 
p. Consider the factor group G/K which has smaller order than G. Since d/p divides 
the order of G/K, by induction, there exists a subgroup H < G/K of order d/p. By 
Lemma 2.13, there exists a subgroup H of G with K < H< Gand H =H/K. But 
then H is the subgroup we seek, since |H| = |H||K| = (d/p)p = d. 
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Theorem 2.13 (First Isomorphism Theorem) The following statements are 
true: 


1. Fundamental Theorem of Homomorphisms: /f ¢:G,— G2 is a group 
homomorphism, then G,/ker(@) = 6(G). If in addition @ is an epimorphism, 
then Gi /ker(¢) = Go. 


2. Correspondence Theorem: For any normal subgroup N of a group G there 
is an inclusion preserving one-to-one correspondence between (normal) sub- 
groups of G containing N and (normal) subgroups of G/N. 


& Ifo: Gy > G2 ts a group homomorphism, there is an inclusion preserving 
one-to-one correspondence between (normal) subgroups of G containing ker(@) 
and (normal) subgroups of G/ker(@). 


Proof 2.30 The first statement is the Fundamental Theorem of Homomorphisms 
which we have already proved (see Theorem 2.10). 

To show the second statement, set H to be the subgroups of G containing N and 
H to be the subgroups of G/N. Define the map f :H > H by f(H) = H/N. By 
Lemma 2.13, we know that f maps both into and onto H when H is simply a subgroup 
or when H is a normal subgroup. We now show that f is inclusion preserving. First, 
if we take Hy, Hy © H with Hy C Hg, then certainly H,/N C H2/N and so f(Hy) C 
f(H2). Second, if we have f(Hi) C f(H2), then Hi/N C Ho/N. So for hy € Ay 
we have hiN € H2/N and so there is an hz € Hz such that hiN = h2N. Hence, 
hyhz! € N C Hy which implies that hy € hgH2 C Ho. To show f is one-to-one, 
notice that if f(H1) = f(H2), then certainly f(Hi) C f(H2) and f(H2) C f(A). 
Therefore, by the work above, we get H, C Hz and H2 C Hy and so Hy = Ho. 

The third statement follows from the second with N = ker(@). 


Example 2.41 In the following examples we illustrates aspects of Theorem 2.13. 


1. Consider the group Z with normal subgroup 8Z. Figure 2.19 illustrates the cor- 
respondence between the subgroups of Z containing 8Z and the subgroups of 
Z/8Z. 


2. Consider the quaternions Qg = {+1, 41,47, +k} and the normal subgroup N = 
{+1}. Figure 2.20 illustrates the correspondence between the subgroups of Qs 
containing N and the subgroups of Qs/N. 


Theorem 2.14 (Second Isomorphism Theorem) /f G is a group with H <G 
and N <G, then 


1. HN<G. 
2 HANH. 


39. HNIN~ H/HON. 
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Z Z/8Z 
2Z 2Z,/8Z 
| 
AZ, AZ,/8Z 
| | 
8Z, 8Z,/8Z 


Figure 2.19 First example illustrating the Correspondence Theorem. 
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Figure 2.20 Second example illustrating the Correspondence Theorem. 


Proof 2.31 To prove the first statement, by Theorem 2.5, it is enough to show that 
HN = NH. Now this is evident since N <G which implies, in particular, that for 
any hé H we have hN = Nh. 

The second statement is simply a verification of normality, since we know that 
HON <G. Therefore, take anh € H andanxe HON. Since h,x € H we have 
h-!zh € H and sincex € NAG we also have h~!xh € N and soh"zhe HON. 
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For the third statement, first note that since NA HN and HNN <H the quotient 
groups HN/N and H/H1N make sense. Define the map ¢: H — HN/N by 
o(h) =hN. The map ¢ is certainly a homomorphism, since 


b(hihz) = (hihe)N = (hi N)(h2N) = o(h1)o(h2). 


In addition, @ is an epimorphism, since for any (hn)N € HN/N we have 
(hn)N = AN and so o(h) = AN = (hn)N. Finally, we show the kernel of @ is 
HON. Indeed, h € H is in the kernel of d iff O(h)=N iff AN=N iff hEN 
and sohe HNN. Now we invoke the Fundamental Theorem of Homomorphisms to 
get H/HON = H/ker(¢) = HN/N. 


In this final Isomorphism Theorem we see that quotient groups act very much 
like fractions. 


Theorem 2.15 (Third Isomorphism Theorem) /f G is a group with H,K dG 
and K CH, then 


1. H/KAG/K 
2. (G/K)/(H/K) = G/H. 


Proof 2.32 Consider the map ¢: G/K > G/H by o(gK) = gH. We first need to 
verify that @ is a well-defined mapping. To see this take g.K = goK which implies 
ng’ € K CH and so yH = gH, i.e. O(g,K) = ¢(g2K). Certainly, ¢ is an 
epimorphism. Next we show that the kernel of ¢ is H/K. The coset gK is in the 
kernel of 6 iff O(gK) =H iff gH=H iff g¢H iff gk € A/K. This gives 
us the first statement, since H/K = ker(¢) (G/K and the second statement follows 
from the Fundamental Theorem of Homomorphisms. 


We will later illustrate the importance of these Isomorphism Theorems when we 
prove some results about solvable and nilpotent groups. 


EXERCISES 


1 As was done in Example 2.41, illustrate the Correspondence Theorem for each 
of the following groups: 


a. The group G = Zig and the normal subgroup H = {0,6, 12}. 
b. The group G = Zi, and the normal subgroup H = {1,3, 9}. 


c. The group G = Dy, and the normal subgroup H = {1, pe}, where p is a 180° 
rotation of the square. 


2 Let G be a finite group. 


a. Show that if H < kK <G, then [G: Kk] < [G: H]. 
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b. 


Let H < G and define H° = {H9 : g € G}. Let f : H? —> G/Na(F#) by 
f(H9) = gNe(A#) ( recall that Ne(H) ={g € G : H9 = A} ). Show that 
f is a well-defined one-to-one map (and so, it maps onto G/Ng(H) as well). 
Use parts a. and b. to show that if [G@ : H] < oo, then |H®| < 00. 


Let H,K < G and define f : H/(H NK) — G/K by f(h(HN K)) =AK. 
Show that f is a well-defined one-to-one map (and so, it maps onto G/K 
onto as well). 


Let H,K < Gand [G: H],|G: K] < ow. Use part d. to show that [G : 
HK] <o. 


We have seen that N = (\j¢g H% is normal in G. Use parts c. and e. to show 
that if H < G and [G: H] < ow, then [G: N] < oo. 


CHAPTER 3 


simple Groups 


N THIS CHAPTER, we shall look at two families of non-abelian groups: The alter- 
nating group and the projective linear group. We will prove that they are both 
families of simple groups. 


3.1. THE ALTERNATING GROUP 


The first family of simple groups, which is the topic of discussion in this lesson, is Ay 
with n = 3 or n> 5. First, we prove a useful lemma. 


Lemma 3.1 If X is the collection of all 3-cycles in S, for n > 3, then Ay, = (X). 


Proof 3.1 First note that any 3-cycle (k | m) = (k m)(k 1) € An and so X C 
A, Since (X) is the smallest subgroup containing X, we have that (X) C Ap (see 
Exercise 61. in Section 2.2). 

To show the reverse inclusion is suffices to point out that (k m)(k 1) = (k lm) 
and (k m)(n l) = (kl n)(k mn), since this implies any product of an even number 
of transpositions can be pairwise rewritten as 3-cycles and thus in total as an element 


of (X). 
Theorem 3.1 Jfn41,2,4, then Ay is simple. 


Proof 3.2 First, we dispense with the small alternating groups: A, = 1 which is 
not simple, Ag =1 which is not simple, and A3 ~ Z3 which is simple. Lastly, Ag is 
not simple, since one can check that the following is a normal subgroup of Aq: 


N = {1,(1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}. 


Now assume that n > 5 and we show that Ay is simple. Suppose, to the contrary, 
that we had a proper normal subgroup N <1 Ap. 


Claim 3.1 N has no 3-cycles. 
Suppose, to the contrary, that there was a (k lm) € N. Now take any (k' I'm’) € 
A, (see Lemma 3.1) and let 0 € S;, be such that 


o(k’)=k, ofl!) =I and o(m’') =m. 
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Then (k'  m') = o'(k l m)o. Therefore, if ¢ € An, we would then have 
(k’ U m’') € N, since N < An. If o ¢ An, then choose T = (r s) € Sy with 
r,s € {k,l,m} (here is where we need the fact that n > 5). Then, since disjoint 
cycles commute, we have 


(ki Um’) =o07\(k l m)o = 0 '1(k bl m)to = (Ta) (kL m)(T0) EN. 


So, in either case we have this arbitrary 3-cycle (k' lm’) € N. Hence, N contains 
all the 3-cycles, which implies by Lemma 3.1 that N = (X) = An, a contradiction. 


Claim 3.2 Anyo € N has a disjoint cyclic decomposition made up entirely of cycles 
of length < 3. 

Suppose, to the contrary, that this claim were false and take anyo € N. Then 
the disjoint cycle decomposition of 0 would include an s-cycle with s > 3, say T = 


(kimr---). Set p=(mlk)e A, and 
t=p 'tp=(kilm\(klmr---\(mlk)=(lmkr--:), 


which differs from t only in the first 3 places. Note that p-'op € N <A, and has a 
disjoint cycle decomposition which differs from o in that T is replaced by tr’. Therefore, 


(p ‘op)o'=r'r *=(Imkr---)(---r milk) =(ir k). 
But then (lr k) € N which contradicts the first claim we proved. 


Claim 3.3 Any ao € N has a disjoint cyclic decomposition made up entirely of 
transpositions. 

Suppose, to the contrary, that this claim were false and take 0 € N having 38- 
cycles in its disjoint cycle decomposition. Then using the second claim, the disjoint 
cycle decomposition of o would include at least two 3-cycles, say (k | m)(k' Ul! m’). 
Indeed, if o contained a only single 3-cycle (k lm), then o? € N would equal a 
single 3-cycle (k | m)(k | m) = (k ml), contradicting the first claim. Set o' = 
(ki! m)~'o(k' Um) € N <Any. Notice that 


of = (mU KY (ke Lm) (KU ml) (Rl Um) + = (inl m RYE LT), 
so that 

oot=(m mE \kKLU)(mi km UR)=(kk ml m)EN, 
but this contradicts the second claim. 


Claim 3.4 Any ao € N has a disjoint cyclic decomposition made up entirely of 
transpositions which uses all the elements of {1,2,...,n}. 

Suppose, to the contrary, that this claim were false. Then there is aa € N and 
an x € {1,2,...,n} which does not show up in the disjoint cyclic decomposition of o 
into transpositions. Say o = (k l)\(mr)--- and seto’ = (la k)-to(lxk)€ N<AAg. 
Note that 


G=(kelo(lak)=(kab(kDilak)(mr)---=(@k)(mr)---, 
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which differs from o only in the first transposition. But then 
go=(@hikh=hiazyen, 
but this contradicts the first claim. 


Having established these four claims, we can now achieve a final contradiction 
to the assumption that Ap, is not simple. Take anya € N< Ap. Since o is even 
and the fourth claim, we know o = (k I)(m n)(r s)--- and n = 6,8,10,.... Set 
ao’ =[(m r)(k m)|~to[(m r)(k m)] € N <d An. However, 


o =(k m)(m r)(k I)(m n)(r s)---(m r)(k m) = (k s)(l m)(r n)---, 
which differs from o only in these first three transpositions so that 
ola = (k s)(t m)(r n)(k D(m n)(r 8) = (km r)(l 8 n), 
but this contradicts the third claim, since o'a € N. 


To prove the next result, which is a consequence of the theorem just proved, we 
first need to point out the Z(S;,,) = 1 for n > 3. Indeed, take any o 4 1 in S,. As 
such, there is an 7 with o(i) #7. Since n > 3, we can construct a7 € S, with 7(7) =i, 
T(o(i)) = 7 where j 4 i, a(t). Notice then that 


ot(t) =o(t) while ro(i) = 7 Z oft). 
Thus, ot # To and so o ¢ Z(S;,). 


Corollary 3.1 [fn #4 4, then the only proper non-trivial normal subgroup of Sy is 
An. 


Proof 3.3 As in the proof of Theorem 38.1, one sees that the statement is true for 
n= 1,2,3 and false forn = 4. Therefore, we assume n > 5. First note that certainly 
An <I Sp, since |S, : An] = 2 (see Exercise 14 in Section 2.9). Now take any NS), 
with N #1. Then NN Ay < An and so by Theorem 3.1, either NO An = An or 
NN Ay = 1 which in turn implies either An < N or NM Ay = 1. In the former case, 
since |S,,| = 2|A,| either |N| =|A,| or |N| =|S,| and so either N = A, or N = Sy. 

It suffices to show the latter case when NO Ay, = 1 is not possible. Since both An <I 
Sn and N <1S, we know that A,N < S,,. And since |A,N| > |A,| and |S,| = 2|A,|, 
it must be that |A,N| = |S,| and so S, = A,N. Furthermore, since NN A, = 1, 
we have 2|A,| = |S,| = |AnN| = |An||N|, by Proposition 2.3, and so |N| = 2. Let 
1ATEN. Since N<S;, this means for allo € Sy, we have o~'to = T. But then 
To =To and sot € Z(S,) = 1, yielding a contradiction. 


Remark 3.1 Now that we have shown A, to be simple forn #1,2,4 we can exhibit 
a family of groups which illustrate the fact that if d divides the order of a finite group, 
there does not necessarily exist a subgroup of order d. The family of groups are An 
for n > 5. Each of these groups has order divisible by n!/4, yet has no subgroup of 
order n!/4. Indeed, if it did, then the index of that subgroup in A, would be 2 thus 
making it a normal subgroup of An, contradicting the fact that A, is simple. 
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EXERCISES 


1 Verify that the following is a normal subgroup of Ag: 


N = {1, (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}. 


3.2 THE PROJECTIVE LINEAR GROUPS 


In this section, we shall look at second family of simple groups. This family is called 
Projective Linear Groups which are factor groups of certain matrix groups. The reader 
is forewarned that some of the topics in this section do not necessarily flow from earlier 
sections. These topics include fields, finite fields which will be discussed later in the 
text; topics in linear algebra such as bases and vector space linear transformations; 
as well as knowledge of the commutator subgroup presented later on in the text. 


Definition 3.1 Let K be a field either infinite (this includes the rational, real and 
complex numbers), or finite (in which case |K| = q where q = p* with p a prime 
number and k a natural number). 


1. The n-by-n general linear group over a field K, written GL,(K) is the 
collection of all invertible n x n matrices with entries from K. 


2. The n-by-n special linear group over a field K, written SL,(K) is the 
collection of all invertible n xn matrices with entries from K with determinant 
Ls 


3. The n-by-n upper triangular matrices over a field K, written B,(K), are 
the collection of matrices A = [a;;| with entries from K with ay = 0 ifi> j. 


4. The n-by-n unipotent matrices over a field K, written U,(K), are the 
collection of upper triangular matrices entries from K which have 1’s on the 
diagonal. 


5. The general linear group of a vector space V over a field K, written 
GLy(K), is the collection of all vector space automorphisms of V. 


6. If |K| =q where q= p* with p prime, then for the above structures we employ 
the notation GLy(q), SL£n(q), Bn(q), Un(q) and GLy(q). 


For our purposes, the vector space V is assumed to be finite dimensional, say 
of dimension n. The linear algebraic fact that there is a one-to-one correspon- 
dence between vector space automorphisms and their matrix representation with 
respect to a fixed basis leads to the fact that GL,(K) = GLy(K). Note also that 
GL,(K)/SL,(K) ~ K* via the homomorphism which takes an element of GL,,(K) 
to its determinant (and the use of the Fundamental Theorem of Homomorphisms). 
We now give some facts in the case when |K| = q. 
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Theorem 3.2 Suppose K is a field with q elements where q = p* and p prime, then 


1. |GLn(q)| = (g" — Iq" — 49) + (gq? — 9"). 

2. |SEn(q)| = (@" — 1)(@" — @)--- (@" — a") /(a— J). 

3. Un(g)| = rer”. 

4. qr-V/? és the largest power of p which divides the order of GLn(q). 


Proof 3.4 For the first statement, we will, in fact, compute the size of GLy(q) and 
appeal to the fact that GL,(q) = GLy(q). First fix a basis for V, say v1, v2,.--,Un- 
Recall that a linear transformation is completely determined by where it sends the 
basis V1, V2,...,Un and to be an isomorphism it should send the basis v1,v2,...,Un 
to another basis, say W1,W2,...,Wn, for V. Hence, computing the size of GL, (q) is 
reduced to counting the number of distinct bases for V. For our first vector w, in any 
basis for V, we have q" —1 choices (since we cannot choose the zero vector). Now the 
span of w, contains q vectors. So to choose our second linearly independent vector 
wz, we must choose outside of the span of wi, hence leaving us with q” — q choices. 
The span of w,,w2 contains q? vectors, so we have q” — q? choices for w3, and so on. 
This argument can be made more formal using induction. Therefore, the number of 
bases for V, and hence the size of GL,(q), is (q” —1)(q” —q)-+:(q" —q""4). 
For the second statement, notice that 


q—1=|K"| = |GLn(q)/SLn(Q)| = |GEn(Q)l/|SLn(Q)1, 


and so |SLn(q)| = |GLn(q)|/(a — 1) = (g" - 1)(a" -@)--- (Q? -—@""")/(G- I). 


The third statement is a simple counting argument. The number of entries above 
the diagonal in ann x n matriz is 1+2+-+--+(n—1) =n(n—1)/2 and the number 
of choices for values in each of these entries is q. 

For the fourth statement, since Un(q) < GLn(q) we have qr"-Y/? = |Un(q)| 
divides |GL,(q)|. An elegant way to show this is the largest such power of p requires 
Sylow Theory (which we haven’t covered yet), however another way to see part iv is 
to notice that 


IGLn(q)| = (" — Iq" -— 9) (q" — a") 


( 
=q¢ gq’ \(qr-1 
1 


gee re) (q” 


= Ph VP — 1g" -1)---(q-1), 


and so we see that q’\"—/? divides |GLy(q)|. Furthermore, the remaining factors of 
|GLn(q)|, namely (q” —1),(q’-!—1),...,(q—1), are all not divisible by the prime 
p and so p cannot divide the product. 


The next result states that the center of invertible matrices consists of scalar 
matrices. This center is an ingredient in the family of simple groups we wish to 


define. 
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Theorem 3.3 Each scalar matriz aI with a # 0 commutes with all the elements 
of GL,(K) and any element of GL,(K) which commutes with all the elements of 
GL,(K) must be a scalar matrix al with a £ 0, i.e. 


Z(GLn(K)) = {aln : ae K*}. 


Proof 3.5 For any matrix A and scalar matrix al, we have A(al) = a(AI) =aA= 
(al)A, so then 
{al, : a€ K*} C Z(GL,(K)). 


Now suppose B is a matrix in GL,(K) which commutes with every element of 
GL,(K). Choose any v1 € K” and extend to v1, V2,...,Un @ basis for K”. For i = 
2,3,...,n, take A; € GLy(KX) to be the matrices having the property that Ajv; = v; 
for j #7 and Ajv; = v1 + vu; (note that A; maps a basis to a basis and therefore 
is invertible). Consider the subspace U; of K" consisting of the vectors which are 
fixed by A;. It’s easy to check that U; = span(v1,...,Ui-1, Vit1,---,Un). Note that for 
i = 2,3,...,n and every u; € U; we have that A;(Bu,;) = (A;B)u, = BA;u; = Bu; 
which puts Bu; € U; for alli. If we set U = (\f_. Ui, this in turn means that for 
uéU we have Bu € U. Now U = span(v) so that Bu, = a1v1 for some ay € K. 
Since v, is chosen arbitrarily in K”" we get that for any v € K” we have Bu = av for 
some a € K. In particular, for each i we have Be; = aje; for some a; € K (where 
€1,€2,---,€n ts the standard basis). This makes B a diagonal matrix. But notice that 
for any i # j, we have on the one hand that B(u; + v;) = Bu; + Bu; = av; + ajv; 
while on the other hand B(v; + vj) = a(uj + vj) = av; + av;, for some a € K. By 
unique representation for a basis we get a; = a and aj =a and so a; = a; fori Fj. 
Therefore, B= al for somea€ K and certainly a #0, since BE GL,(K). 


Definition 3.2 An element of SL,(K) is called a transvection if it is not the 
identity matrix yet it fixes pointwise some subspace of K” of dimension n—1 (called 
a hyperplane). 


Example 3.1 Any matrix of the form I, + aki; where a € K* andi # j isa 
transvection, where Ej; is a matrix filled with zeros except that the ij-th entry is 1. 


Remark 3.2 Some consequences of Theorem 3.8 are the following: 


1. It follows from the proof of Theorem 8.8 that Z(GL,(K)) ~ K* and if |K| = 
q < oo, then |Z(GL,(K))| =q-1. 


2. Z(SL,(K)) = {al, : a” = 1}, since we require that det(al) = 1. Therefore, 
if |K| = q < ov, then |Z(SL,(K))| = ged(n,q— 1). Note, this follows from a 
fact we will see later in the text that K* is a cyclic group of order q —1. 


3. Each A; as defined in the proof of Theorem 3.3 is a transvection of the form 
In + yy where i = 2,3,...,n. 


4. From the proof of Theorem 3.3, we see that a matrix in GL,(K) will com- 
mute with all the elements of GL,(K) iff it commutes with the collection of 
transvections of the form I, + Ey; where i = 2,3,...,n. 
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Definition 3.3 For any field K, 


1. The n-by-n projective general linear group over a field K, written 
PGL,(K) = GL (K)/Z(GIn(K)). 


2. The n-by-n projective special linear group over a field K, written 
PSLn(K) = SLn(K)/Z(SEn(K)). 


It follows then from our work in this section that if K is a finite field of order q, 
then 


PCL a(a)| = pee = (gt —)la" =a)" <a Mla 1) = [SE 
IPSL(a)| = peg = (gh = 1)lgh =a) (a =P Yq = Nged(ng = 2) 


Having introduced the family of groups PGL,(k) and PSL,(k) we look in 
detail at what makes many of these groups simple. First, we look more carefully at 
transvections. As we have seen already, it is the transvections that are the key to all 
our results about this family of groups. 


Lemma 3.2 Every transvection is conjugate to the transvection I, + Ey2. In other 
words, if A € SL,(K) is a transvection, then there exists a P € GL,(K) such that 
In + Eyg = P7'AP. Furthermore, if n > 3, then every transvection is conjugate to 
In + Ey. by an element of SL,(K). 


Proof 3.6 Let A be a transvection fixing pointwise some subspace U of dimension 
n—1. Using the linear algebraic fact regarding similarity of matrix representations, if 
we can find a basis for K” such that the matrix representation of A with respect to that 
basis is In, + Ey2, then we will have proved the lemma. Take any basis uy, u2,...,Un—1 
forU and anyv € K"—U. Now, we can express Av = ayuy,+-+-+@n—-1Un—-1t+av where 
Q1,---,4n-1,a€ K. Ifa, =--- = ay,_, = 0, then the matrix representation of A with 
respect to the basis u1,U2,...,Un—1,U is diag(1,...,1,a) and since A € SLp(K), 
this implies that a = |diag(1,...,1,a)| = 1. Thus, the matrix representation is the 
identity matrix I, which contradicts the definition of transvection. Therefore, it must 
be the case that some a; # 0. For this i with a; # 0, replace u; in our basis for 
U by the vector ayuy +--+ + Gn—1Un—1 € U which will be linearly independent from 
U1,+-+, UWi-1, Wi41;-+-;Un—1 (left as an exercise). Now re-label the basis for U so that 
ayy +-+++4n_-1Un_1 ts uy. Hence, Av = uy +av and then the matrix representation 
of A with respect to the basis u,v, U2,...,Un—1 ts diag(1,a,1...,1)+ E12. As before, 
a= 1andso the matrix representation of A with respect to the basis uy, VU, U2,.--,Un—1 
is In + Ey2 and we’re done. 

Let A be any transvection with n > 3. By the work above, there is a P € GL,(K) 
such that P-1AP = I, + Ex. Set D = diag(1,1,...,1,|P]|). One can check that for 
n> 3 we have D“'(In + Ey2)D = In + Eye 80 that (DP~')A(PD“) = I, + Eye and 
thus A is conjugate to I, + E,2 via PD~!. Now notice that |PD~!| = |P||P|-! = 1 
and so PD~' € SL,(K). 
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Corollary 3.2 All transvections are conjugate to each other in GL,(K). Further- 
more, ifn > 3, then all transvections are conjugate to each other in SL,(K). 


Proof 3.7 By Lemma 38.2, since every transvection is conjugate to In + E42, they are 
therefore conjugate to each other and ifn > 3 they are conjugate via an element of 


SL,(K). 


Theorem 3.4 Every element of SL,(K) can be expressed as a product of transvec- 
tions of the form I, +aEj;; where a € K* andi # j t.e., SLy(K) is generated by these 
transvections. Every element of GL,(K) can be expressed as a product of transvec- 
tions of the form I, +aE;; where a € K* andi 4 j and a diagonal matrix of the form 
diag(1,1,...,b) for some b € K* i.e., GIy (K) is generated by these transvections and 
diagonal matrices. 


Proof 3.8 Take any A = [aj] € SL,(K). Since A is invertible, we can row reduce 
it to the identity matriz. We need to show explicitly how we row reduce A to I, to 
prove the theorem. 

We may assume that ay2 # 0 (otherwise add some row i with ay; #0 to row 2). 
Now dag (1 — 41) times row 2 added to row 1 changes ay, into 1. Using this pivot 1 
and row operations of the form aR, + R; withi > 1 anda e€ K* we can put 0’s below 
the pivot 1. Now the (1, 1)th minor of the resulting matrix is in SLy_1(K) so we may 
repeat the process just mentioned on this minor. Continuing in this way using the 
same type of elementary row operations we can effectively row reduce A to In. 

Now notice that we did this using only elementary row operations of the form 
aR;+ R; witha € K* andi ¥ j. Recall from linear algebra that these elementary row 
operations corresponds to multiplication on the left by the elementary matrices of the 
form I, + aki; with a € K* andi j. Notice also that Un + ali;;)—* =e 
which is a transvection of the very same form. Therefore, again by linear algebra, 
I, = Ex:++E2E,A where each E; is of the form I, + aki witha € K* andi F j. 
But then A= Ey'Ex'---E,' where each E;' is of the form In + aEj; with a € K* 
andi # j. 

The proof for GL,(K) is identical, but we need to note that using only elementary 
row operations of the form aR; + Rj with a € K* andi ¥ j, we can only reduce an 
element of GL,(i) down to diag(1,1,...,0) with be K*. 


We now have enough machinery to classify the normal subgroups of GL,(K) and 
thereby show that almost all the groups PSL,(K) are simple groups. Here now is 
the main result from which this follows. 


Theorem 3.5 [f GL,(K) #4 GLo(2),GL2(3) and H < GL,(K) such that for all 
A € SL,(K) and B € H we have A'BA € H, then either SL,(K) < H or 
H < Z(GL,(K)). 


We shall delay the proof of this theorem for a bit and first look at some of the 
consequences. The reader should check that the above result is false when GL, (A) = 
GL2(2) or GLo(3). The first two corollaries are immediate and require no proof. 
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Corollary 3.3 If GL,(K) 4 GL2(2), GL2(3) and H AGL, (K), then SL,(K) < H 
or H < Z(GL,(K)). 


Corollary 3.4 If GL,(K) 4 GLe(2),GL2(3) and HA SL,(K), then either H = 
SL,(K) or H < Z(SL,(K)). 


Corollary 3.5 If GL,(K) 4 GL2(2), GL2(3), then GL, (K)’ = SL,(K). 


Proof 3.9 Since GL,(K)/SL,(K) = K* is abelian, we have that GL,(K)! < 
SL,(K). So then GL,(i)' a SL,(K) and thus, by Corollary 3.4, if we can show 
there is an element of GL,(K)! not in Z(SL,(K)), then we will be done. Take, for 
instance the commutator [In + E12, In + E21| which is not a scalar matriz. 


Corollary 3.6 If GL,(K) 4 GL2(2), GL2(3), then PSL, (K) is simple. 


Proof 3.10 Let N < PSL,(K) and write N = N/Z(SL,(K)), where N <A SL,(K). 
Then by Corollary 3.4, either N = SL,(K) or N < Z(SL,(K)) and so either 
N= PSL,(K) or N is the trivial subgroup. 


Remark 3.3 Let’s look at some of the smallest groups of the form PSL,(K). 


1. PSL2(2) and PSL2(3) are not simple. One can see this by either pointing out 
that their corresponding orders are 6 and 12 and there are no simple groups 
of these orders. Another way to see this is to show that PGL2(2) = S3 and 
PGL2(3) = Ag both of which are not simple groups. 


2. PSL2(4) and PSL(5) both have order 60 and one can show that the only simple 
group of order 60 is As. Hence, PSL2(4) and PSL2(5) do not give rise to any 
new simple groups. 


3. PSL2(7) has order 168 which is not the order of any alternating group. There- 
fore, PSL2(7) is the smallest new simple group of this type. 


4. PSL3(4) has order 20,160= 81/2 = |Ag|, however PSL3(4) is not isomorphic 
to Ag. This can be shown by looking at the element (1 2 3 4 5)(6 7 8) € Ag of 
order 15 and proving that PSL3(4) has no element of order 15. Hence, there 
are two non-isomorphic non-abelian simple groups of order 20,160. 


We now prove Theorem 3.5. 


Proof 3.11 Certainly, every H < Z(GLn(K)) has the property that for all A € 
SL,(K) and B € H we have A~'BA € H, so we may assume that H has an element 
outside of Z(GL,(K)). There are two cases for this proof which are each proved in 
very different manners — one highly computational and the other not as much. These 
two cases arise due to the fact that transvections are conjugate in SL,(K) only when 
n> 2. 


Case 1: n=2 
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First, we need to show that if H contains a transvection of the form Iz + akj; 
where a € K* andi j, then SLo(K) < H. So suppose I2+akj; € H where a € k* 
andi # j. We shall show then that every transvection of the form Iz + aEj; where 
a€ K* andi #j is in H and hence, by Theorem 3.4, we will be done with the case 
of n = 2. First note that it is enough to show that every transvection of the form 
I, + ak. € H where a € K*, since by assumption we would then have 


[a t}-[2 a} [ot] [t ale 


Therefore, let’s assume some matrix of the form ; 1 


| € H witha € Kk*. 
Notice then that for any b € K*, we have 


fo ]= [os] Lo s]Le" see 


Hence, for all b,c € K*, we have 


1 ab? ti “oe? ee) 1 a(b? —c?) 
0 1 0 1 | 0 1 : 


Now in the case that char(K) 4 2, the equation a(b? — c?) = d has a solution 
in b and c for any choice of d € K*. Indeed, the solution is b = 2~'(a~'d+ 1) and 
c = 27-'(a-'d—1). Note this does not work for char(K) = 2 for then 2 = 0 which 
has no inverse. So we are reduced to proving this case of n = 2 when char(K) = 2. 

In the case that char(KX) = 2, first note that K* must contain an element d with 
the property that d* # 1, for otherwise |K| = 3 or 5 (since K* is cyclic). If we set 
b = ad’, then by the work above 


1 b des Oa la G0 
F 1] eH be coninatin | Ace Hh 


If we set 
e=a‘(1+d") =ala 7(1+d *)] = ala? + (a 'd)"] = a(a | +0)’, 


then again by the work above 


Ie *G ; ‘ 1a 
fe 1 | € H by conjugating E i by 0 a7! +a7d 


Before we continue, note that it is easy compute (exercise) that abc = a + b, 
be =14+ d? andac=1+d~? which we will need in a moment. As we saw above, we 


know that 
1 0 1 0 ; 
sts Le | and uh, 4 | are in H, 
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thus also in H is 


1 0 Lae Le O eM 1 —be c _ | -@ ¢ 
—a l 0 1 —b 1| | abe—a-—b 1l-ac| Or. sag? )| © 


Now take any e € K* and set x = e(d~* —1)7!. Notice that also in H is 


Payers 


Furthermore, one can easily compute that 


Lo" i] fo a] fe[o ty) 


Therefore, any matrix of the form 5 ; | with e € K* is in H, thus finishing 


the case of characteristic 2. 

Recall, in the beginning of the proof that we chose A € H—Z(GL,(K)). In linear 
algebra, we know that A is similar to a matrix in rational canonical form. Let us 
first consider the possibilities for this form. A might have two linear invariant factors 
a 0 
0 6b 


t—a, t—b in which case the rational canonical form would be , but since 


; a . Furthermore, a? # 1, since A is 


not in Z(SLo(K)). The other possibility is that A has a single quadratic invariant 


A € SLo(K) it must, in fact, be C = 


factor t? — at — b in which case the rational canonical form would be : : | but 


b 
i oe : 4 0 1 
again since A € SL2(K) it must, in fact, be C = a . Let P € GL2(K) be the 
matrix by which we conjugate A to put it in rational canonical form. Let’s consider 
each case separately. First, suppose that C = ‘ st | and set B= ; ; } 
. 1 l-a? 
Consider the commutator |C, B| which one can compute to be 0 1 . Since 
a? # 1, this implies that [C, B] is a transvection of the form Iz+(1—a~?)E\2. Notice 


also that 3 
[C, B] =[A?, B] = A-P-AP? = (a1ae" ) e HP. 


1 —v? 
Os at 


. and for anyb € K* set B= 


Now suppose that C = 
1 


_ 72 
oe | i is in H” for all b € K*. Conjugating [C~', B] 


as above [C~', B] = 
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i b-t —p-t 0 1 

0 b —-1 2+ 
in H? — this follows, since H? has the same property as H does, i.e. A € SLn(K) 
and B € H? we have A7'BA € H?, since B = D? for some D € H and so 


which must also be 


| € SL,(K) yields the matriz 


A BA= (Cae par" c HP. 


Then for allb € K*, H? contains the matrices 


ic (i a (a sa 
—-1 2+1*|  |0 1 


Therefore, H® will have a transvection unless b+ = 1 for all b € K*, but as 
we saw above this could only occur if |K| = 3 or 5. So we need another strategy 
for finding a transvection in SL2(5) (the group SL2(3) is ruled out by assumption). 

1 


From our above work we know that a 3 € H? and also the commutator of 


0 1 
at Dab 


-1 
0 il 1 -2 1 -2 _ ; : 
<A | and 01 | which equals 2 9 | when q = 5. Conjugate this 


result by = = | € SL2(5) to get ee : 


ie aly 


which is a transvection in H?. Thus, in every case, H? has a transvection of the 
form Ip + aE \2 where a € K*. Since, as we saw above, H? has the same property as 
H does, i.e. AC SLn(K) and B € H? we have A7'BA € H?, by our earlier work, 
SLo(K) < H? and so SL,(K) = SL,(K)?' < H. This completes the first case. 


| € H?. And lastly, note then that 


Case 2: n> 2 


Recall again our choice of A € H — Z(GL,(K)). By Remark 8.2.4, there must 
be a transvection T € SL,(K) which does not commute with A. Set B = |[A,T] € 
GL,(K)' = SL,(K). Note also that B= A~!A? € H and by assumption, B 4 In. 


Claim 3.5 H contains a transvection 

If B were a transvection we would be done, so we may assume that B is not a 
tranvection. Let U be the n —1 dimensional subspace of K” fixed pointwise by the 
transvection T~4 and W is then—1 dimensional subspace of K” fired pointwise by the 
transvection T, then B = T-AT fixes pointwise UAW, a subspace of K” of dimension 
at least n— 2. Now, if dim(UNW) =n-—1, then B € H is a transvection, contrary 
to our assumption. Hence, it must be the case that dim(UNW) = n—2. Set U' equal 
to the kernel of B—I, and W' the image of B—I,, and so dim(U') + dim(W’) = n. 
Since B is not a transvection, it must be the case that dim(U') < n— 2. Since 
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dim(U NW) = n— 2, this implies that dim(U’) = n —2 and so dim(W’) = 2. Let 
W” be a subspace of K” of dimension n—1 containing W’ (here we use the fact that 
n > 2). Take any w € W" and write Bw = (B-—In)wt+tweWwW'+Ww”" CW". 
Therefore, B fixes setwise W". Let C be a transvection which fixes W" pointwise. 
Then C~® is also a transvection fixing W" pointwise. Now [B,C] = C~®C fires W” 
pointwise as well, but could be the identity. Note that [B,C] = B~'B® € H, so if 
[B,C] were a transvection we would be done. So we may assume that [B,C] = I, 
for all tranvections C which fix W" pointwise (we will show this is not possible). 
Since B is not a tranvection, there must be aw € W" such that Bw 4 w and yet 
Bwew". Firav € K"—W" and select the transvection D which fixes W" pointwise 
with the additional property that Dv = w+v. By our assumption, |B, D] = In, and 
so BD = DB. In particular, BD(v) = DB(v). Since (B — I,)u € W' C W" this 
implies that (B —I,)v = w' for some w' € W" and so Bu = v+ w'. Therefore, 
DB(v) = Dwt+w') =wtvt+u' while BD(v) = B(wt+v) = Buwt+vu+w' and so 
Bw =vw, a contradiction. 


Having proved this claim, since all transvections are conjugate in SL,(K) it fol- 
lows that H contains all the transvections. Since SL,(K) is generated by the transvec- 
tions we then have that SL,(K) < H. 


EXERCISES 


1 Verify that any matrix of the form J, + aE; where a € K* andi # 7 is a 
transvection, where E;; is a matrix filled with zeros except that the ij-th entry 
is l. 


2 Verify in Lemma 3.2 that ayuzy +--+ + an—1Un_1 € U is linearly independent 
from Uy,...,Uj—1, Wit1,---,Un—1- 


3 Verify in Lemma 3.2 that D~!(I, + Ey2)D = In + Exp. 
4 Verify that the result of Theorem 3.5 is false when GL,(A’) = GL(2) or GL2(3). 


5 In the proof of Theorem 3.5 verify that abc = a+b, be = 1+d? and ac = 1+d~? 


CHAPTER 4 


Group Action 


N THIS CHAPTER, we present a powerful idea in group theory which is group action. 

The consequences of group action are many and the applications are incredibly 
useful as we shall see in the later sections in this chapter. Indeed, if someone wants 
to see concrete applications in group theory, then group action certainly fits the bill. 
In Section 4.1, we introduce the notion of group action. In Sections 4.2 and 4.3, we 
see a very nice application of group action in the field of combinatorics. Section 4.4 
presents some deep theoretical consequences which follow from group action. The 
important Sylow Theorems in Section 4.5 follow from group action. We then use the 
Sylow Theorems to classify finite groups of particular sizes in Section 4.6, which in 
turn assists us in classifying completely finite abelian groups in Section 4.7. 


4.1 GROUP ACTION ON A SET 


In this section, we present a way in which a group with all its structure may interact 
with a set which has no structure whatsoever. This notion will lead us to many deep 
results in the theory of groups. 


Definition 4.1 Let (G,*) be a group and X any set. We say G acts on X if there 
is a binary operation - from G x X to X having the following properties: 


1. For allg,h € G and all x € X we have g-(h- x) =(g*h)-a. 
2. For alla € X we havel-xr=z. 


One also says that G defines a group action on X or that X is a G-set. 


Example 4.1 Here, we present several examples of group action. 


1. Let G= S, with * = o (composition) and X = {1,2,3,...,n}. We can let G 
act on X as follows: foro € G andi € X, define the group action 0-i = a(t). 
For instance, in S4, if o is a 90° rotation of a square, then o -2 = o(2) = 3. 
One needs to verify that we have indeed defined a group action. 


First take 0,7 € G andi € X. Then 


eel 4) Sor) = (6 os) G) = woz) <4. 
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Second, for the identity permutation t andi € X, we have 


$4 SA) Sk 


. Let G = GL,(R) and X = R”. Then G acts on X via matrix multiplication, 


i.e. for AG GL,(R) and v € R” (viewed as a column vector), then A-¥ = AU 
where the operation is matrix multiplication. The reader can easily verify that 
this indeed defines a group action. 


Let G= Si, and A = {a1,d2,...,Qn} be any set with n elements. Then for any 
k =1,2,3,... we can have G act on A® as follows: 


OF aa see y= (46(41)s @e(ix); a eens 


Remark 4.1 One may view a group action (and sometimes this is given as the 
definition) as a homomorphism 1: G + Sym(X). Indeed, given that G acts on X, 
then we can define 1(g) to be the permutation which sends each x to g- x. Likewise, 
given a homomorphism from G to Sym(X), we can define a group action as g- «= 


[7(g)] (x). 


Example 4.2 We now present four important group actions which we shall use in 
later discussions. The reader should verify that all four examples do indeed define 
group actions. 


iL, 


Let a group G act on itself (i.e. X = G) by left multiplication, i.e. fora € G 
andg € X define a-g =ag using the group operation. 


. Let a group G act on itself by conjugation, t.e. fora € G and g € X define 


1 


a-g=aga~ using the group operation. 


. Let G be a group and H <G. Set X = {gH : g ©€ G}, the collection of left 


cosets of G modulo H. Let G act on X_ by left multiplication, i.e. fora € G and 
gH € X define a: (gH) = (ag)H (note that one needs to verify that the action 
is well-defined). 


. Let G be a group and X be the collection of subgroups G. Let G act on X by 


conjugation, i.e. fora € Gand H € X define a- H = aHa™! (note that one 
should check that aHa~' < G). 


We now define several important structures associated with a group action. 


Definition 4.2 Let G be a group acting on a set X withge Gandxe X. 


1. 


The stabilizer of x, writtenG, ={gEG : gr= 2x}, ie. the group elements 
which fiz a particular x. 


The fixator of g, written X,={x EX : gu =a}, ie. the set elements fixed 
by a particular g. 
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3. The orbit of x, written Gx = {gx : g € G}, te. the elements of the set that 
can be realized by allowing all of the group to act on a fixed x. 


One can easily show that G, is a subgroup of G. Note that both X, and Gz are 
subsets of X. 


Example 4.3 Let us return now to our four important examples above and compute 
the structures we just defined. 


1. Gy is the trivial subgroup, Xg = 9 unless g = 1 in which case X; = G, and 
Gg =G. 


2. Gg = Calg) (recall the centralizer of g in G), and so is Xz = Ca(g), while 
Gg = 9° = {aga : a€ G} called the conjugacy class of g in G. 


3. Gg = gHg"' the collection of group elements which are conjugate to an ele- 
ment of H via the element g, and G(aH) = X. Note that in particular Gy = H. 


4. Gy are the set of all elements in G that satisfy the normal property for H. The 
notation for this set is No(H), i.e. 


Ne(H) ={g EG: gthg eH for alihe Hy}, 


and is called the normalizer of H in G. One can show that H < Nc(H) <G. 
The orbit GH = H° = {a-'Ha : a€ Gh. 


It is useful to note that orbits can be defined as the equivalence classes of a par- 
ticular equivalence relation on the set X the group G acts upon. Define the following 
relation on X: x ~ y iff there is a g € G such that gx = y. This defines an equiv- 
alence relation on X with equivalence classes being precisely the orbits of G acting 
on X. Indeed, ~ is reflexive since for any x € X we have lx = x (definition of group 
action). We have symmetry, since if x ~ y, then there is a g € G with gx = y, but 
then using the definition of group action this can be rewritten as g-!y = x and so 
y ~ x. For transitivity, if ¢ ~ y and y ~ z, then there are g,h € G with gx = y and 
hy = z. But then (hg)x = z using the definition of group action, and thus x ~ z. If 
we take any « € X and compute the equivalence class 


[t]={yeX : yrapa{yeXx : dgeG, y= gr} 
={gt : g€G}=Gzr. 


One use of this observation is the immediate result that any two orbits of a group 
action are either disjoint or coincide (since equivalence classes have this property). 
We now begin our discussion on counting results in this setting. 


Theorem 4.1 Let G be a group acting on a set X. Then for a given x € X there is 
a one-to-one correspondence between the elements in the orbit Ga and the cosets of 
G modulo G,. In particular, if G or X is finite, then |Gx| = |G: G,]. If both G and 
X are finite, then we have |G| = |G,||Gax| and so the size of an orbit divides the size 


of G. 
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Proof 4.1 We simply define a map f : G/G, > Gx by f(gGz) = gu. First note that 
f certainly maps onto Ga by its very definition. Second, f is both well-defined and 
one-to-one, since 


gGr =hGy iff h'ge€Ge iff h*gr=x iff gr=he iff f(gGe) = f(hGe). 
Thus, f is a bijection which proves the result. 
Corollary 4.1 Let G be a finite group with H < G and g € G, then 

1. The size of the conjugacy class of g equals [G : Ce(g)]. 


2. The number of the conjugacy classes of H equals |G: Ng(H)]. 


Proof 4.2 For the first statement, consider the group action in Example 4.2.2. The 
result follows immediately from Theorem 4.1 and our computations of Gx and Gy, in 
this setting. 

For the second statement, consider the group action in Example 4.2.4. The result 
follows immediately from Theorem 4.1 and our computations of Gx and G, in this 
setting. 


Example 4.4 Consider the cycle types (conjugacy classes) of elements in S4. 


CycleType Number 


Notice how the sizes of the conjugacy classes divide the order of the group Sa, 
since conjugacy classes are orbits. Let’s take it a step further. It’s easy to show (and 
is left as an exercise) that a subgroup of a group is normal iff the subgroup is a union 
of conjugacy classes. Equipped with this fact we can now show that the only subgroup 
of Sq of order 12 is Ag. 


Proposition 4.1 [f H < S, and |H| = 12, then H = Ag. 


Proof 4.3 First note that any subgroup of order 12 in Sq is normal, since it has 
index 2 in S4. Therefore, by Exercise 10, the subgroup must be a union of conjugacy 
classes. Of course, since it is a subgroup it must contain the singleton conjugacy class 
{1} which leaves 11 more elements to select. The sizes of the other four conjugacy 
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classes are 6, 3, 8 and 6. The only way to make eleven is using 8 and 8. Hence, there 
is only one way to make a subgroup of order 12, and so it must be Aq (note also that 
the cycle types in the conjugacy classes of size 8 and 8 are indeed even permutations). 


Example 4.5 Let G= S,, and rt = (1 2). We will count the number of conjugates of 
T in two different ways. 

First, we will count them directly. Note for any o € S, that oro~! = (a(1) o(2)) 
(see Exercise 9 in Section 2.4) and so the conjugates of T consist of all the transposi- 
tions in S,. The number of transpositions in S,, equals n(n—1)/2, since (k l) = (Lk). 
Therefore, the number of conjugates of T is n(n — 1)/2. 

Now we will count the number of conjugates of T using Corollary 4.1.1. We will 
have to count Cg(r) in order to do this. Note that ot = to iff oro |=7T iff 
(o(1) o(2) )=( 12) a ethero(1)=1 ond o(2)=2 oro) =2 and o(2)=1. 
Hence, |Ce(r)| = (n — 2)! + (n — 2)! = 2(n — 2)!. Now, by Corollary 4.1, the number 
of conjugates of T is 


IG] al n(n — 1) 
ot 


IG : Ce(7)] 


 (Ca(r)] An — 2)! | 

One final observation is that Example 4.2.1 and 3 do not yield much information 
when we apply Theorem 4.1 to each of them. For Example 1, we get |G| = [G : ]j 
which is trivial and Example 3 yields |G/H| =|G: H] which is simply the definition 
of index. 


EXERCISES 


1 Verify group action for Example 4.1, parts 2 and 3. 


2 Prove the alternate definition discussed in Remark 4.1 is equivalent to the one 
given in this text. 


3 Verify group action for each action defined in Example 4.2. 
4 For Example 4.2.3, verify that the action is well-defined. 
5 For Example 4.2.4, verify that aHa"! < G. 


6 Let G = Sy and set X equal to the set of all transpositions in $4. Set « = 
(2,4) € X. Let G act on X as follows: For o € G and (i,j) € X define 


o(i,9) = (a(4), 09) 

a. Verify that the above definition does indeed define a group action. 
b. List the elements of G, and thus compute |G,|. 

c. List the elements of Ga and thus compute |Gz|. 


d. Now compute |Gz| using the Proposition which states that |Gz| = [G : G,]. 


7 Verify for G acting on a set X and x € X that G, is a subgroup of G. 


14 


15 


16 
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Verify all the statements for each of the four examples given in Example 4.3 
regarding stabilizers, fixators and orbits. 


Prove that for H < G we have H < Ng(H) < G (there are two things to prove 
here). 


Prove that a subgroup of a group is normal iff the subgroup is a union of conju- 
gacy classes (note that one direction was proved in Exercise 15 of Section 2.9). 


Prove that an element g in a group G has a conjugacy class of size 1 iff g € Z(G). 
Prove that in an abelian group all conjugacy classes are of size 1. 


Give an example of two non-isomorphic groups of the same size with the same 
number of conjugacy classes of any given size. Must the groups be abelian for 
this to be possible? (explain) 


Consider a finite group G acting on a set X. 


a. For g€ Gand x € X show that Gyr = g ‘Gag. 

b. For g € G and z € X show that |G,| = |g-4Ggl. 

c. For x,y € X show that if Ga = Gy, then |G,| = |G,]|. 

Let G be a group and H < G. Consider the action of G on G/H by left 
multiplication (i.e. a(gH) = (ag)H). 

a. Express this action in terms of a homomorphism ¢ (see Remark 4.1). 

b. Show that kerd < H. 


c. Assume that [G : H] = 2. Apply the First Isomorphism Theorem on part a. 
to show that [G : ker@] < 2. 
[ 
[ 


d. Assume that [G : H] = 2. Use parts b. & c. to show that [G : ker@] = 2. 


e. Assume that [G : H] = 2. Use parts b. & d. to show that H = ker@. 
f. Using part e., Show that if H < Gand [G: H] = 2, then H is normal in G. 


A group G acts transitively on a set X if for any x,y € X, there existsag EG 
such that gx = y. A group G acts doubly transitive on a set X if for every 
L1,Y1, 2, Yo with x1 ~ y, and xy # yo, there exists a g € G such that gr, = x2 
and gyi = Ye. 

Show that G acts doubly transitively on a set X iff for any x € G, Gy, acts 
transitively on X — {x} and G acts transitively on X. 
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4.2 BURNSIDE’S LEMMA 


In this section, we introduce a result, called Burnside’s Lemma, which has surprising 
consequences in combinatorics allowing us to count various things with ease. Although 
its proof is quite straightforward, it yields a nice formula for counting orbits. Before 
we prove this result we need some more terminology. 


Definition 4.3 Let G be a group acting on a set X. The characteristic function 
associated with this action is a function f :G x X — {0,1} defined by 


1, gv=2 


f(g.) = 
0, gr #~x 


Lemma 4.1 (Burnside’s Lemma) /f G is a finite group acting on a finite set X, 
then the number of orbits of X equals 


ay Mil 


gEG 


Proof 4.4 Notice that 


Seoles: (= Hy = (= i, ») 


gEG gEG \rEx zEeX \geEG 
ao 1 
= >01G2| = ire eS eeerara 
xeX nex | tex [G2] 
Furthermore, consider a typical orbit Gx = {x1,...,x,} and notice that 


r 


=1. 
ica tea =D ia We ~ 1Ga] 
Therefore, 


S> |X,| = |G| (=< Ge aa) = |G| x (The number of orbits of X). 


gEG wEX 


Example 4.6 We now give some nice counting arguments which use Burnside’s 


Lemma. 


1. Consider the letters a,a,b,b,b,c,c and suppose we wish to count the number 
of distinct (nonsense) words we can produce using all seven letters (these are 
called arrangements). For instance, one such word could be acabcbb. Consider 
G = Sz acting on the set X of permutations — arrangements — of the seven 
letters in the natural way (i.e. for instance, if o is the 7-cycle( 1234567), 
then o(acabcbb) = bacabcb). Let x be the word aabbbcc. First note that the size 
of Gz equals 2!3!2!, since any o € Gz may permute the first two letters, the 
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second three letters and the last two letters of x and still fix x. Second, the size 
of Ga equals 


_ Gl. | 7 _ 
IG: Gel = 1a = agi = 2,3,2 = 210. 


Now, tt is intuitively clear that the orbit of x equals the entire set X and so the 
number of arrangements of a,a,b,b,b,c,c equals the size of this orbit, namely 
210. In other words, this group action has only one orbit, so we need not employ 
Burnside’s Lemma to show this fact. However, with a lot of patience one could 
actually compute the number of arrangements. Let us try this computation with 
a smaller example. 


We will count the number of arrangements of the letters a,a,b. In this case 
G = $3 which consists of 


1, (123), (13:2), (23) 1 3), 11 2), 
One can check that their corresponding fixators are 
Xi = X, X(1 23) = 9, Xi 3.2) =9, 
X23) = {baa}, Xa 3) = {aba}, X12) = {aab}. 
Therefore, since there ts one orbit, 


1 


1 
a] ol = lX1t0+0+1 +141) =1 


gEG 


Hence, |X| = 3, which is indeed the case, namely aab, aba and baa. 


Assuming we know there is one orbit, one can easily extend this argument to 
produce a general formula for counting arrangements, namely suppose we con- 
sider producing arrangements using the n letters 


Q1,41,---,41,42,02,...,42,.--,47,4r,...,4r, 
a — 
n, times no times n, times 


where n = ny + ng +---+n,. Then the number of distinct arrangements of 


those letters ts 
n n! 
11, 72,+-+5 Tp ny!ng!-++n,! 


. Consider a circle divided into 6 equal sectors as in Figure 4.1. Suppose we 
can color a sector either black or white and we wish to count the number of 
distinct ways of coloring the circle in the sense that two colorings of the circle 
are distinct if you cannot get from one to the other by rotating the circle. One 
says distinct up to rotation. To do this we label the sectors with numbers. 
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WAY 


Figure 4.1. Symmetries of the triangle. 


We consider the subgroup H of Se consisting of the six rotations of the circle 
0°, 60°, 120°, 180°, 240° and 300°. These elements of Sg are 


oo=1, oo =(123456), of=(135)(2 46), 


o3 = (1 4)(25)(36), o1=(153)(2264), of =(165432). 


Let X be the collection of all possible ways to color sectors 1,2,3,4,5 and6. The 
size of X is 2° = 64. Let H act on X in the natural way by rotating the circle the 
appropriate number of degrees, i.e. 0; will rotate the circle (60i)°. Notice that 
for a given coloring of the circle, the orbit of this coloring is precisely the set of 
colorings which are considered not distinct. Therefore, if we wish to count the 
number of distinct colorings of the circle, then we need only count the number 
of orbits of this action. Burnside’s Lemma does exactly this thing. We first need 
to compute the six fixators. Certainly, the fixator of of identity is the entire set 
X. The fizator of 0, consists of the circle with all white sectors and the circle 
with all black sectors. The fixator of a2 consists of the circle with all white 
sectors, the circle with all black sectors, the circle with even numbered sectors 
being black, and the circle with odd numbered sectors being black. The fixator of 
a3 consists of the circle with all white sectors, the circle with all black sectors, 
the three circles with two opposite sectors being black and the three circles with 
two opposite sectors being white. Finally Xo, = Xo, and Xz, = Xo,. Therefore, 
using Burnside’s Lemma, the number of orbits equals 
1 my 


iA (|Xoo| + |Xo,| + |Xo2| + |Xo3| + |Xo,| + |Xo5|) = g Otte tt 84442) = 1. 


The reader may wish to list the 14 distinct colorings. 


8. Consider the same setup as Example 2, but in addition assume that the coloring 
of a circle shows through to the back of the circle. If we wish to count the number 
of distinct colorings this means we are counting the number of distinct colorings 
up to rotation and reflection. There are exactly six rigid reflections of the circle, 
namely the reflections across the three diameters marked on the circle and the 
reflections across the three diameters which bisect opposite sectors. Name these 
six reflections to be o6,...,01, and note that Dg = {00,01,.--, 711}. The reader 
should check that the first three reflections have fixators of size 8 while the second 
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three have fixators of size 16. Therefore, using Burnside’s Lemma, the number 
of orbits equals 


TP 3 Xo; 


Perhaps surprising, this additional restriction on distinctness barely reduces 
their number. The reader should look at the list of distinct colorings compiled 
in Example 2 and decide which two of the 14 colorings are now being equated. 


= a(O4+2444844424848484 164 16+ 16) = 13. 


EXERCISES 


1 List the 14 distinct colorings in Example 4.6.2 
2 Compute by hand the additional fixators in Example 4.6.3. 


3 Which two of the 14 colorings in Example 4.6.2 are now being equated in Ex- 
ample 4.6.3? 


4 Count the number of unique dominos first using a simple combinatorial argu- 
ment, then by using group action. 


4.3 POLYA’S FORMULA 


We first define Polya’s Formula and then we will apply it to counting distinct color- 
ings. It arises from Burnside’s Lemma for counting orbits. Polya’s Formula is derived 
from the cycle types in the group acting on the colorings. Each cycle type found in 
the group corresponds to a monomial in Polya’s Formula. 


Example 4.7 If a permutation in Siz has cycle type (*)(*) (2) () (#*) (> * &) the 
2 


corresponding monomial in Polya’s Formula is xtx3xq4. 

In general, if a permutation has cycle type which includes n cycles each of length 
m, then x, is included in the monomial in Polya’s Formula corresponding to that 
cycle type. The coefficient of this monomial will be the number of permutations in 
the group acting on the colorings of that particular cycle type. 


Definition 4.4 Let G be a subgroup of S, acting on a set of colorings. Polya’s For- 
mula is a polynomial in unknowns x1, %2,...,%n of the form 


€1 _,€2 

3S Ly Xo eee 
ai ocG 

where in the formula above, the a has e; cycles of length i in its cycle type (for 


i=1,2,...,n). 


Example 4.8 Consider the example from the previous section for coloring the circle 
with siz sectors. 
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1. If our group is the siz rotations, then the cycle decompositions, cycle types and 


Polya monomials are 


listed below: 


Permutation Cyclic Cycle Type Polya 
Decomposition Monomial 

0° (1)(2)(3)(4)(5) (6) | (*) 4) OH) 4) (4) xy 

60° (123456) (eae ee) L6 
120° (1.3 5) 46) (« & &)(* & *) oe 
180° (1 4)(2 5)(3 6) (se) (4k) (4) is 
240° (1 5 3)(2 6 4) (« & &)(* & *) ic 
300° (165483 2) (2 KK 2k) XG 


Therefore, Polya’s Formula is 


1 
P(x, £2, 23, £6) = —( 


2. If our group is the six rotations and six reflections, then we add the following 


siz rows to the table: 


6 


x + 2x6 + 2x3 + 73) 


Permutation Cyclic Cycle Type | Polya 
Decomposition Monomial 

I (1)(4)(2 6)(3 5) | ()(«)(#*)(**) | ae? 
p12 (2)(5)(1 3)(4 6) | (%)(«)(#*)(**) | ae? 
1s (3)(6)(2 4)(1.5) | (e)(«)(#*)(**) | a2? 
7 (12)(3.6)(4.5) | (ex)(ee)(ex) | 28 
Is (2 3)(14)(5 6) | (ex)(ex)(ex) | 2 
1 (3.4)(25)(1.6) | (ex)(ee)(ex) | 28 


Therefore, Polya’s Formula is 


1 
P(21, £2, 23, £6) = —( 


12 


x? + ag + 2x3 + 403 + 30723). 
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As you probably noticed, Polya’s Formula looks very similar to Burnside’s Lemma 
for counting orbits. In fact, each monomial corresponds to a fixator. Polya’s Formula 
has two primary uses: To count the number of distinct colorings and to produce 
the inventory of unique colorings. First, to count the number of distinct colorings 
simply evaluate all the x; by the number of colors used. This makes sense since all 
the numbers in a cycle must be colored with the same color to remain fixed, thus 
there are as many ways to color the numbers in that cycle as there are colors. 


Example 4.9 Consider again Example 4.8. 


1. If we are looking for the distinct colorings up to rotation, we have seen that 
Polya’s Formula is P(x1,%2, 23,26) = #(x§ + 2x6 + 2x3 +23). So the number of 
distinct colorings with two colors is P(2,2,2,2) = $(2°+2-2+2-27+23) = 14, 
which got us to our answer much quicker than Burnside’s Lemma. In fact, now 
we can easily compute distinct colorings with three colors to be P(3,3,3,3) = 
4(3°+2-34 2-3? +39) = 130. 


2. If we are looking for the distinct colorings up to rotation and reflection, we have 
seen that Polya’s Formula is P(x1, £2, 3,6) = 7y(@$+2xg+273+403432} 23). 
So the number of distinct colorings with two colors is P(2,2,2,2) = 4 (28 +2: 
242-2744.23+43-2?.27) = 13. Again, we can easily compute distinct colorings 
with three colors to be P(3,3,3,3) = 4 (35+2-34+2-3?+4-334+3-32-37) = 92. 


Now let’s address the second use of Polya’s Formula, namely to produce the in- 
ventory of all distinct colorings. Polya’s Formula will be used as a generating function 
to list all the possible colorings. How it works is as follows: Let P(x, 22,...,%n) be 
Polya’s Formula — a multi-variate polynomial in the unknowns 21, %2,...,%n. Suppose 
our colors are C1,C1,.-.,Cm- If we evaluate each x, at 37%, c¥, we will get a poly- 
nomial in c1,¢€2,...,;¢m which will describe explicitly the full inventory of distinct 
colorings. Indeed, replacing x, by 37”, cf is saying we must color all the elements in 
a k-cycle the same color, having m colors to choose from. Having done so, the coef- 


: e1 €2 Cys m m 2 m n 
ficient of cy'cs? --- ch in POE, ci, 21 G,--- 721 @) corresponds to the number 
of ways to color using e; colors of c,, e2 colors of cg, ..., Em colors of Cm. 


Example 4.10 Let’s return to Example 4.8. 


1. For distinct colorings of the six sectors up to rotation, we derived Polya’s For- 
mula P(x1, 22, 23,26) = a(x$ + 2a + 2x2 + x3). Suppose first we are coloring 
using black and white. To get the full inventory of colorings we evaluate 


1 
P(b+w, b+”, b?+w3, &+w®) = 5 ((b+w)?+2(b° +?) +2(b+w")"+(b°+w")") 


= w® + bw® + 3b2w* + 4b3w? + 3b4w? + d>w + BS. 


What this is telling us is that there is one way to color them all white, one way 
to color using one black and five white, three ways to color using two black and 
four white, etc. Notice we still have to decide what the colorings are, but at least 
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we know how many we are looking for of each type. Notice also that if we set 
b=1 and w =1 in the resulting polynomial, we once again get the number of 
distinct colorings. Using three colors b,w,r we begin to see the true power of 
Polya’s Formula: 


= -((b+w+r)® +2(b° + w® + 7°) + 2(08 + w? + 3)? + (+0? +1°)*) 


=wtrw?+bw? +3r2wt + 5sbrw* +30? wt + 4r3.w? + 10br2w? + 100? rw? + 4b? w? 
+3r4w? + 10br?2w? + 160772 w? + 100? rw? + 3b4w? + rw + 5br4w + 10027? w 
+10b°r2w + 5b*rw + Dw + r® + br® + 3b?r4 + 437? + 3b4r? + ber + BS. 


2. Let’s repeat the process for colorings distinct up to rotation and reflection. We 
derived Polya’s Formula in this case to be 


1 
PGi; fo, 03.06) = 7p ("1 + 296 + 202 + 4x3 + 32723). 


Thus, for two colors we evaluate 


P(b+w,b? + w?, 0? + w?, b® + w®) 


1 
= S((b+ w)® + 2(0° + w8) +2003 + w9? +4? + 0)? + 366+ Ww)? + w)?) 


= w® + bw? + 3b?w* + 3b3w? + 3b*w? + b>w + B8. 
Comparing the inventory in the last example, focusing on the coefficients of 
bw? we see that it is here that the number of colorings was reduced by one. 
Let’s try three colors: 
Pibtwt+r,b? +w?+r?,b? + w? + r?, 0° + w® +r) 
=, 
= 19 
t4(b? + w* + 77)? + 3(b+ w+ r)?(b? + w? + 17)?) 


= w'+rw°? + bw? + 3r2wt + 3brw* + 3b2w* + 38r3w? + brew? + 6b2rw? + 3b3w? 


((b+ w+r)® + 2(b° + w® + r®) + 2(0? + w? + r)? 


43r4w? + 6brow? + 11b2r2w? + 6b? rw? + 3b4w? + Pew + 3br4+w + 6b?r2w 


+6b?r?w + 3b'rw + bw + 7° + br? + 30?r4 + 3b8r? + 3b*r? + br + OF. 


Example 4.11 This example deals with counting distinct graphs. If two vertices in 
the graph are connected by an edge, we will consider that edge colored black. If they 
are not connected by an edge, we will consider that (nonexistent) edge painted white. 
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Let’s count the number of graphs with five vertices. Before we can do this we need to 
count the number of edges it can have, but this is not difficult. 


The number of edges will be ( ; = 10. 


Therefore, the number of graphs possible is 2!° = 1024. In general, 


n 
the number of graphs with n vertices is i z . 


Now some of these graphs with five vertices are not distinct. For instance, the 
graph containing only the edge connecting vertices 1 and 2 is the same as the graph 
containing only the edge connecting vertices 3 and 4. What we really want to count is 
the number of distinct (or in this case we say non-isomorphic) graphs. What then 
do we mean by distinct in this case? What we mean is there is no permutation of the 
vertices which preserves all the edge connections, i.e. fixes the coloration. So it boils 
down to counting something we already know how to do. 

Let’s start with an easier set of graphs, namely one with four vertices. So our 
set X is the collection of all graphs with four vertices (there are 2° = 64). Now the 
entire permutation group G = S4 is acting on the set vertices of the graphs in X. We 
require a table of all the possible cycle types in S4 and the number of each type (we 
did a similar thing in Section 8.4 using Ss). 


Cycle Type | Number 


(2) (+) x5 5 = 3 
(x > 2) a =6 


Since X is really the collection of colorations of edges, we need to find the correspond- 
ing permutation of the edges in order to create Polya’s Formula. We shall denote the 
collection of edges by 12,13, 14,23, 24,34, where mm means the edge connecting the 
vertices numbered m and n. Let’s add one more columns to our table using a generic 
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permutation of each vertice cycle type. 


Vertice Cycle Type | Number | Edge Cycle Type 
(1)(2)(3)(4) 1 (12) (13) (14) (23) (24) (34) 
(1)(2)(3 4) 6 (12)(13 14) (23 24) (34) 

(1)(2 3 4) 8 (12 T3 14)(23 24 34) 
(1 2)(3 4) 3 (12)(T3 24)(T4 23) (34) 
(1234) 6 (12 23 34 14)(13 24) 


Therefore, Polya’s Formula is 


1 
5g (Tl + Oxx3 + 8x3 + 6r2x4), 


and the number of distinct graphs with four vertices is 


1 
pg (2 + 9:2" 2 +8-2°+6-2-2) = 11. 


We can exhibit the inventory of distinct graphs with four vertices by evaluating 
1 
yas w)® + 9(b + w)?(b? + w*)? + 8(b? + w)? + 6(b? + w?) (bt + w)), 
but since we only care about existing edges we can replace w by 1 and evaluate 


1 
rvs 1) + 9(5 + 1)7(b? + 17)? + 8(b? + 17)? + 6(6? + 17)(b* + 1*)) 
[p+ +3 4 2b +. 


In Figure 4.2, we display the distinct graphs. 


EXERCISES 


1 Investigate the case of graphs with five vertices as we did with four in Exam- 
ple 4.11. 


2 We wish to paint the roof of a house (see Figure 4.3). There are four sections of 
roof each of which can be painted in one of two colors: sienna and ochre. Our 
goal is to investigate the number of distinct colorings up to (two) rotations and 
(two) reflections. 


a. Use Burnside’s Lemma to count the number of distinct colorings. 
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Figure 4.2 The distinct graphs with four vertices. 


Figure 4.3. The roof of a house. 


b. Create the Polya Polynomial associated with this problem. 
c. Use part b. to count the number of distinct colorings. 
d. Use part b. to list the inventory of colorings. 


e. Exhibit an example of each of the distinct colorings. 


4.4 SOME CONSEQUENCES OF GROUP ACTION 


In this section, we prove some important results in group theory using group action. 
First, we re-prove Cayley’s Theorem using group action. Notice below how easily 
the result falls out using group action. 


Theorem 4.2 (Cayley’s Theorem) /f |G| = n < ov, then G is isomorphic to a 
subgroup of Sy. 


Proof 4.5 Let G act on G by left multiplication. This induces a homomorphism 
17: G— Sym(G). The kernel of this map is trivial, since if for some a € G we have 
n(a) = 1, then [r(a)|(g) = g for all g € G. In particular, [x(a)|(a) = a or a? = a, 
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which implies a=1. Thus by FTH, 
G2=G/1=G/kera = 7(G) < Sym(G) & Sp. 
Thus, G is isomorphic to a subgroup of Sp. 
The next result generalizes the result that a subgroup of index 2 must be normal. 


Theorem 4.3 (Ore’s Theorem) Let H < G a group. If |G: H| = p and p is the 
smallest prime dividing the order of G, then HAG. 


Proof 4.6 Let G act on G/H by left multiplication. This group action gives rise to a 
homomorphism 7: G+ Sym(G/H). The kernel of this map is contained in H, since 
if ™(g) =1, then in particular, |x(g)|(H) = H or gH =H which implies g €¢ H. We 
show, in fact, that ker = H (and so HAG). Suppose to the contrary that ker x # H. 
By FTH, G/kern = n(G) < Sym(G/H) = S, and so [G : kerm] = |7(G)| which 
divides |Sym(G/H)| = |S,| = p!. We also know that 


[G : kera] =[G: H][H : kera] = p[H: kerz], 


and so p divides |G : ker 1]. Write p! = [|G : ker ]m for some integer m, then multiply 
m on both sides of |G : ker 7] = p[H : ker a] to get p! = [G: kera]m = p[H : ker a]m. 
Cancelling a p on both sides of the last equation gives that |H : kera] 4 1 divides 
(p — 1)!. Therefore, there exists a prime q < p which divides |H : kerz], and thus 
divides |H|, and thus divides |G|, which is a contradiction. 


Remark 4.2 We point out that in general if H < G and |G: H] =p a prime, then 
it need not be the case that H AG. Consider the group G = S3 and H = (1) where 
tt = (2 3). Notice that [|G : H] = |G|/|H| =6/2 =3 a prime, yet H is not normal in 
G. Indeed, take p = (1 2 3) € G and compute 


p typ = (32 1)(23)(123) =(12) ¢ H. 


Our goal now is to derive using group action an important combinatorial group 
equation called the class equation. 


Theorem 4.4 Let G be any finite group. Then 
IG] = |Z(@)| + DIG : Ca(gi)], 
i=1 


for some natural number m and some g; € G with |G: Ce(gi)| > 1. Note that m = 0 
exactly when G is abelian. 


Proof 4.7 Let G act on itself by conjugation as in Example 4.2.2. Since orbits are 
equivalence classes on the set G and G is finite, we have 


G= |_| Goi, 


i=1 
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for some g; € G and some positive integer r. Now some of these orbits Gg; may 
contain only a single element and we’ve seen that |Gg;|=1 iff g, € Z(G). Therefore, 
all the singleton orbits of this action may be combined to obtain Z(G). If G is abelian, 
then all the orbits are singletons and G = Z(G). Otherwise, write the union as 


G=Z(G)u | Gigi. 
i=1 
for some g; € G with |Gg;| > 1 and some positive integer m. This yields the equation 
|= 12@)1+ SIG = 12G)1+ DIG = Cola 
for some g; € G with |G: Ce(gi)| > 1 and some positive integer m. 


Example 4.12 We will illustrate the class equation with an example. Consider the 
dihedral group D4 = {1, pi, P2, P3, 11; 12, M3, Ma}, where 


pr=(1234), pr=(13)(24), p= (1432), 
Let G act on G by conjugation. We first compute all the distinct orbits of this 
action (recall that orbits partition the set being acting upon into disjoint equivalence 


classes). The orbit of 1 is always {1}, since for all g © G we have glg-! = 1. We 
now compute the orbit of p1: 


{1p11~", pipipy*, 020102 ', P3P1P3 > HaPiley s M2Pily', Mapi bs, Mapipg '} 


= {p1, p3}- 
The orbit of pz ts 


{1p21-', pipapy, P2P2P3°; P302P3 Hi Pally” M2P2L3 ', H3P2/l3 |, Map2itg } 
= {pr}. 
The orbit of py ts 
{1ui17", pimp’, poeips', p3lip3, Mieaoy |, Mapa ply, Mapes |, Mapa peg} 
_ tis [u3}. 
Finally, the orbit of pz is 
{1 p21", prop,’ P2b2Py’, P3l2P3 Hi Mely |, MefleHa’, M3H2l3 , Malletg } 
= {/l2, ja}. 
Hence, the partition of D4 into disjoint orbits is 
Da = {1} U {p2} U {e1, p3} U {er wg} U {p2, pa}. 


As in the proof of the Class Equation, the singleton orbits combine to form the 
center of the group and so we see that Z(D4) = {1, po}. 
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We now give several consequences which can be proved using the class equation. 


Corollary 4.2 If G is a group with |G| = p” for some prime p and positive integer 
n, then p divides Z(G), so in particular Z(G) #1. 


Proof 4.8 By the class equation, we have 


p” =|Z(@)| + 1G: Calg) 


i=l 


for some g; € G with |G : Ce(gi)] > 1 and some positive integer m. Note that if 
the sum were empty, then G = Z(G) and the result follows trivially. So assume now 
that the sum has terms. Since for each i, |G| = |G : Ce(gi)||Ca(g:)| this implies 
that |G : Ce(gi)| divides |G|. Now |G| = p” and |G : Ce(gi)| > 1, so it must be that 
IG : Cg(g;)| = p® for some 1<k <n. Regardless of what k is, we can conclude that 
each |G : Ce(gi)| is a multiple of p and therefore \7\",|G : Ca(g:)| is a multiple of 
p. So we can rewrite the class equation as p” = |Z(G)| + Lp, for some integer L. 
Therefore, |Z(G)| = p(p"-' — L) ande we can conclude that p divides |Z(G)|. 


Corollary 4.3 If G is a group with |G| = p? for some prime p, then G is abelian. 


Proof 4.9 By Corollary 4.2, we have p divides |Z(G)| and since |Z(G)| divides |G| 
it must be that either |Z(G)| = p or p*. But |Z(G)| cannot be p, for if it were, then we 
would have |G/Z(G)| = |G|/|Z(G)| = p?/p = p. This would imply that G/Z(G) = Z, 
a cyclic group which by Lemma 2.12 implies that G is abelian, contradicting the fact 
that Z(G) 4 G. Hence, |Z(G)| must be p? and so G must be abelian. 


We point out that there is another way to obtain this last result without the use 
of the class equation (which we will see later). We now prove Cauchy’s Lemma using 
the class equation. 


Theorem 4.5 (Cauchy’s Theorem) [f G is any finite group whose order is divis- 
ible by a prime p, then G has a element of order p. 


Proof 4.10 By induction, we assume all groups of smaller size than G have the 
desired property of this theorem (note that the lemma holds vacuously for the trivial 
group). If G is abelian, then we are done by Lemma 2.14. So we may assume G is 
nonabelian. By the Class Equation, we know 


IG] = |2(@)| + NIG: Calg) 


i=l 


for some positive integer m and some g; € G with |G: Ce(gi)| > 1. Note that the sum 
is not empty, since G is assumed not to be abelian. Note also that for each g; we have 
IG] = [G : Ce(g)||Ce(gi)| and since |G : Ce(gi)| > 1, this makes Ce(g;) a proper 
subgroup of G. If p were to divide the order of one of the Ce(g;), then by induction 
such a Ce(g;i), and therefore G, would have an element of order p thus proving the 
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theorem. So we may assume that for all the g; it is not the case that p divides the 
order of Ce(g:). Since p is prime and |G| = |G : Ce(g:)||Ce(gi)|, it must then be true 
then that for all the g; it is the case that p divides |G : Ca(g:)|. Therefore, p divides 


the sum 
m 


DIG : Ca(w)]. 
i=1 
Since p divides this sum and also divides |G|, by the class equation it must be that 
p divides |Z(G)|. Since G is assumed to be nonabelian, this makes Z(G) a proper 
subgroup of G and so by induction has (and therefore G has) an element of order p. 


There are other ways to obtain Cauchy’s Lemma one of which we will present 
later which uses the Sylow Theorems. With the use of the lemma we prove next, we 
can give an alternative proof of Ore’s Theorem. 


Lemma 4.2 Let G act on a set X and @ be the homomorphism associated with that 
group action. Let Y C X be a set of representatives for the distinct orbits of the group 
action and H =()\pey Gx, then 


ker(o) = () Cy () He <A. 


rex gEG 


Proof 4.11 The results are all contained in the following equivalent statements: k € 
ker(o) iff (kK) =1x if o(k)\(z)=acallanex if k-c=acallctex if keG, 
alxeX iff ke Gy, alge Gandally€Y iff k€ gGyg"' allg €G and all 
yeY if Ke lMgeqg 19. 

Certainly, \geq H9 < A, since H = H!. 


Corollary 4.4 If H < G a group with |G: H| =n, then there exists K AG with 
K < H such that |G: Kk] divides nl. 


Proof 4.12 Let G act on G/H by left multiplication and @ be the associated homo- 
morphism for the group action. Set kK = ker(@) and we shall show that K is the 
desired normal subgroup we seek. First, by Lemma 4.2, 


k=) Gor © Grow: 
gHEG/H 


Second, as in the proof of Cayley’s Theorem, ¢(G) is isomorphic to a subgroup of 
S, which implies |6(G)| divides |S,,| = n!. By the Fundamental Theorem of Homo- 
morphisms, G/K ~ ¢(G) so that |G: Kk] =|G/K| =|¢(G)| which divides nl. 


Theorem 4.6 (Ore’s Theorem (revisited)) Jf G is a finite group and H <G 
with |G : H| =p where p is the smallest prime dividing |G|, then HAG. 


Proof 4.13 By Corollary 4.4, there isa K < H with KG and |G: K] divides p!. 
Setm = |G: K] and note that m divides |G|. Notice that for any prime q dividing m 
we have q divides |G| and q divides p!. But then, since p is the smallest prime dividing 
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|G|, it must be that q = p. Therefore, m must be a power of p, saym = p*. Now since 
p* divides p! it must be that k =1 and som=p. Thus, [G: K] =p=[G: H] and 


= NW es, Se. 
ea Gamay 


Since K < H and |K| =|H| we have H= K AG. 


EXERCISES 


1 As in Example 4.12, determine the class equation for each of the following 
groups: 


a. Qs 
b. Ds 


2 Let G be a group with |G| = p” for p prime and n a positive integer. 


a. Show that if H < Gand |H| =p""!, then H dG. 


b. Show that if H < G and H # G, then there exists g € G H such that 
gig (=H. 


4.5 SYLOW THEORY 


For the purposes of this section we shall always assume that G is a finite group. 
Sylow Theory is an extremely important component in the study and classification 
of finite groups. There are basically three main results in this area called (naturally) 
the First, Second and Third Sylow Theorem. Before we can get to these results we 
need to define terms and prove a few preliminary results. 


Definition 4.5 For a given prime p and H a subgroup of a group G, 


1. H is a p-subgroup if |H| = p* for some natural number k. 
2. H is a p-Sylow subgroup if |H| = p” for some natural number n and p"*+ 


does not divide |G|. In other words, H is a maximal p-subgroup. 
3. G is a p-group if |G| = p” for some natural number n. 


We will now derive a more general class equation for use in the Sylow Theorems 
and as a way to give an alternate and quite elegant proof of Cauchy’s Theorem. 


Theorem 4.7 (General Class Equation) If G acts on a finite set X, then 


|X| =|¥|+ SO[G: Ga], 
i=l 
where [G : Gz,| > 1 for some x; € X and positive integer m and Y = {x © X 
|Gx| = 1} (again, there may be no terms in the sum). If, in addition, G is a p-group, 
then |X| = |Y| (mod p). 
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Proof 4.14 The derivation of this more general class equation is identical to the 
work done to derive the original class equation, so there is no need to reproduce that 
argument here. We will verify the statement that |X| = |Y| (mod p) in the case 
that G is a p-group. Considering the general class equation, if there were no terms 
in the sum, then the statement follows trivially. Otherwise, notice that for each i, 
IG| = [|G : G,,]|Gz,|. Set |G| = p” so that |G: G,,] divides p” which implies that 
p must divide [|G : Gz,| > 1 for all i. Hence, p divides the sum, 7",(G : Go,], 
which by the general class equation implies that p divides the difference |X|—|Y |, i.e. 
IX] = [Y| (mod p). 


Theorem 4.8 (Cauchy’s Theorem) I[f G is a group whose order is divisible by a 
prime p, then G must have a element of order p. 


Proof 4.15 Let X be the set consisting of p-tuples (91, 92,---;9p) of elements of G 
with the property that 9192--- Gp = 1. Note that X is non-empty, since (1,1,...,1) € 
X. In fact, the size of X must be |G|?~! since we can choose the first p—1 components 
of an p-tuples freely in G with the last being determined as (g192---Gp-1)~'. Note 
also that the size of X implies in particular that p divides |X|. Take the p-cycle 
o=(12--- p) € S, and let the subgroup (a) act on X by permuting the subscripts 
of any p-tuple. In other words, 


o* (91, 92, ies Ip) a (Gok (1)s Jok(2)s as Geriey)s 


We leave it to the reader to check (by induction on k) that o*(g1, 92,---;9p) € X 
and that we indeed have a group action on a set. By Theorem 4.7, since (a) is a p- 
group we have that |X| = |Y|(mod p). Thus, since p divides |X|, it follows that p also 
divides |Y |. Let us look more closely at the set Y. Recall, that Y consists of elements 
of X which have orbits each consisting of one element. Note that Y is non-empty, 
since (1,1,...,1) © Y. Take one such (g1, g2,---,9p) © Y. Then 


(0) (91, 92)+++5 9p) = {(91, 92,--+5 9p) fs 


so in particular, O(91;925--++9p) = (91; 92)+++59p) or (GetieGetds=!-49e(p)) = 


(91; 921--+19p) OT (92,93,---1 9p: 91) = (91; 92,--+, 9p). But then 91 = g2 =--- = Gp. 
Recall the property of elements of X, that gig2--: Gp = 1 and this yields if we set 
G= 7 = 92 =°+** = Y that g? = 1. Finally, note that since p divides |Y| we know 


we can find a g #1 with this property g? =1 and so we have proved the theorem. 
We are now ready to prove the three Sylow Theorems. 


Theorem 4.9 (First Sylow Theorem) Let G be a finite subgroup such that p* 
divides |G| with k > 0. Then G has a subgroup of order p*. In particular, G has a 
p-Sylow subgroup. 


Proof 4.16 Let G be a group of order p"r where p is a prime, n is a natural number 
and p does not divide r. The proof is by induction on |G|. When |G| = 1, the result 
holds trivially. By induction, assume that all groups of order less than the order of 
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G satisfy the theorem. If G had a proper subgroup H of order p"s, then by induction 
H (and therefore G) would have a p-subgroup of order p* for 0 <k <n. In the case 
that no such subgroup H exists, we consider the class equation, 


IG] = |2(@)| + NIG: Calg) 


i=l 


for some positive integer m and some g; € G with |G: Cg(gi)] > 1. Furthermore, the 
subgroups Ce(gi) must have order p™r; with ni <n and p does not divide r;. Notice 
that for alli 


pr = |G| = [G: Ce(9i)||Ca(g)| = [G : Ca(gi)|p™ 7. 


Therefore, it must be the case that p divides |G : Ce(gi)| for alli and so by the 
class equation p divides |Z(G)|. By Cauchy’s Theorem, Z(G) has an element, say z, 
of order p. Set H = (z) < Z(G), and thus HAG. Then G/H is a group of order 
p"—'r less than the order of G and so has a p-subgroup K/H < G/H of order p* for 
0<k<n-1, where H < K <G (by the Correspondence Theorem). Since |H| = p 
and |H| divides |K|, this imples that K is a p-subgroup of G of order p**t! where 
1<k4+1<n. The only p-subgroup we have missed is the trivial one, which certainly 
exists. 


Theorem 4.10 (Second Sylow Theorem) Any p-subgroup of a group G can be 
conjugated into any p-Sylow of G, i.e. if H is a p-subgroup of G and P is a p-Sylow 
of G, then there exist g € G such that gHg"' < P. 


Proof 4.17 Let H act on X = G/P by left multiplication. As in the general class 
equation, let Y consist of those elements of X with one element orbits. First note 
that Y is non-empty. Indeed, Since |G/P| is not divisible by p (otherwise P is not 
a p-Sylow) and |X| = |Y| (mod p) this implies that p does not divide |Y| and so 
in particular, |Y| 4 0. So take any gP € Y and note that for all h € H we have 
hgP = gP or equivalently ghg~' € P, and so gHg"! < P. 


Remark 4.3 We make several remarks about conjugation. 


1. Note that if P is a p-Sylow subgroup of G and g € G, then gPg7' is also a p- 
Sylow subgroup of G. To see this simply consider the bijection f : P > gPg~* 
by f(h) = ghg-'. More generally, if H < G, then for any g € G we have 
gHg"' < G and |gHg"'| = |H|. 


2. We remind the reader of some exponential notation introduced for group ele- 
ments. For g,h € G a group, we write hI = ghg~'. Similarly, for H < G, we 
write H9 = gHg"!. 


Corollary 4.5 For any finite group G 


1. Every p-subgroup of G is contained in a p-Sylow of G. 
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2. Any two p-Sylows of G are conjugate in G. Thus, for any p-Sylow subgroup of 
G the set {gPg-' : g € G} consists of all the p-Sylow subgroups of G. 


&. If the number of p-Sylow subgroups in G is equal to 1, then this unique subgroup 
is normal in G. 


4. The number of p-Sylow subgroups in G equals |G : Ng(P)], where P is any 
p-Sylow subgroup of G. 


Proof 4.18 The first part is easy, for if H is a p-subgroup of G, then by the Second 
Sylow Theorem, there is ag € G such that H9 < P where P is a p-Sylow of G. But 
then H < P9' a conjugate of a p-Sylow which is again a p-Sylow (see Remark 4.38). 

The second part is also easy. Take two p-Sylows of G, say P and Q. By the Second 
Sylow Theorem, there is ag € G such that P¥ < Q. But as we pointed out P9 is a 
p-Sylow as well which means |P9| = |Q| and so PI =Q. 

To prove the third part, let P be the unique p-Sylow subgroup. For any g € G, since 
g'Pg is also a p-Sylow subgroup and there is only one, it must be that g~'Pg = P 
which proves that PIG. 

To prove the fourth part, by Corollary 4.1 and Corollary 4.5.ii, the number of 
p-Sylows in G equals |G: Ng(P)] for any p-Sylow P of G. 


Definition 4.6 For any prime p, let n, denote the number of p-Sylow subgroups of 
a given group G. 


Theorem 4.11 (Third Sylow Theorem) /f|G| = p"m with p not dividing m and 
n> 0, then np divides m and is congruent to 1 modulo p. 


Proof 4.19 By the First Sylow Theorem, there is a p-Sylow P < G. Let X be the 
set of all p-Sylows of G and have P act on X by conjugation. This action makes 
sense, since every conjugate of a p-Sylow of G is again a p-Sylow of G. As usual, 
Y will denote the elements of X which have single element orbits. By Theorem 4.7, 
|X| = |Y| (mod p). We will show that Y = {P} so that |Y| = 1 and thus prove half 
of the result. 

First note that P € Y since certainly g°-'Pg = P for any g € P. Now suppose 
that Q € Y. Then g-'Qg = Q all g € P and so P < N@(Q). Since P < Ne(Q) 
and |P| =p", then p” divides |Ng(Q)| so that |Ne(Q)| = p"k with p not dividing k. 
This means that P is a p-Sylow of Ng(Q) as well. Using the same argument, Q is 
a p-Sylow of Ng(Q). By Corollary 4.5, there is a g € N@(Q) such that g-'Qg = P. 
But, since g € Na(Q), we have P=g-!Qg=Q. 

Finally, since P < Ng(P), as in the above argument, |Ng(P)| = p"k with p not 
dividing k. So then 


p'm = |G| = [G : No(P)I|No(P)| = [G : No(P)p"k, 


which implies that m = |G: N@(P)|k and so |G: Ng(P)|, which by Corollary 4.5.4 
is the number of p-Sylows of G, must divide m. 
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EXERCISES 


1 Verify the statement in Theorem 4.8 that 


a. The action defined in the proof is indeed a group action. 


b. o* (91, 92,---;9p) € X for k > 0. 
2 Prove the following exponential laws for g,h,k € G a group: 


a. A9Ik9 = (hk)9. 
b. (h*)9 — fy(kg). 
c. Al’) = (h9) 9"), 


3 Prove that any subgroup of Z(G) must be normal. 


4.6 CLASSIFYING FINITE GROUPS WITH SYLOW THEORY 


We start this section with a general result that goes very far in classifying finite 
groups. We first need to prove a couple of number theoretic results. 


Lemma 4.3 Given p is a prime number and m is any positive integer such that m 
divides p — 1. 


1. The congruence equation x™ = 1 (mod p) has m distinct solutions modulo p. 


2. Ifa is a solution to x™ = 1 (mod p) and k is the smallest positive integer such 
that x* = 1 (mod p), then k divides m. 


Proof 4.20 Without diving prematurely into ring theory (all of this is proved later in 
the text), one can show given a polynomial f(x) of degree m with integer coefficients 
that the congruence equation f(x) = 0 (mod p) has at most m distinct solutions in 
Z,. In fact, there is a criterion for when such an f(x) has exactly m solutions — it 
occurs iff f(a) divides x? — x. Therefore, if m divides p—1, then for some integer 
k we have 


oti ya g™ — 1 = (2™—1)(ce™ D4... 40™41), 


and so x™ —1 divides x”~!—1 which in turn divides x? —x and so the first statement 
is proved. 

To prove the second statement, notice that if a is a solution to x™ = 1 (mod p), 
then a™ = 1 in the multiplicative group Z. To say k is the smallest positive integer 
such that «* = 1 (mod p) is to say that o(a) = k in Zi. Therefore, by Lemma 2.2.1, 
we know that k must necessarily divide m. 
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Theorem 4.12 Let G be a group with |G| = pq for primes p and q. 


1. Ifp=q, then G is an abelian group and there are only two such groups (up to 
isomorphism), namely Z,2 and Zy ® Zp. 


2. Ifp<q and p does not divide q—1, then G is cyclic, t.e. G = Zpq. 


3. If p <q and p divides q —1, then either G = Zpyq is cyclic or G is a unique 
(up to isomorphism) non-abelian group generated by x and y and satifying the 
relations x? =1, y1=1 and yx = xy* where k £1 (mod q) and k? = 1 (mod q) 
with k EN. 


Proof 4.21 [fp =dq, then we have already proved in Corollary 4.8 that G is abelian 
and the fact that there are only two such groups, namely Z,2 and Zp ® Zp, follows 
from the classification of finite abelian groups. Assume from now on that p < q. By 
the First Sylow Theorem, there exists a p-Sylow, say P, and a q-Sylow, say Q. We 
point out that PX Q =1, for the order of the subgroup PQ of both P and Q would 
have to divide both p and q and so necessarily must have order 1. Furthermore, since 
P and Q are each of prime order they must be cyclic. Set P = (x) and Q = (y) for 
some x,y € G. Since p is the smallest prime dividing |G| and |G : Q| = p, by Ore’s 
Theorem, Q<G. By the Third Sylow Theorem, np divides q and is congruent to 1 
modulo p. This implies that np = 1 or q. These two cases correspond to statements 
2 and 3 of this theorem. For notice if ny = q, then q = 1 (mod p) and so p divides 
q—1. 

Therefore, to prove statement 2, we must assume that np» = 1 and so PG, by 
Corollary 4.5. Notice that a~!y~!xy € PNQ = 1 and so x~'y~!xy = 1 or equivalently 
ry = yx. This in turn implies that o(ay) = pq. Indeed, 


(y)P* = (xP )"(y")” = 18” =1, 


and xy #1 otherwise P = Q and finally we show that o(xy) 4 p or q (similarly). If 
it were that o(xy) = p, then 


1 = (xy)? = xPyP = y?, 


and so q divides p which implies q = p, a contradiction. Since o(xy) = pq = |GI, this 
implies that G = (xy) is cyclic, thus proving statement 2. 
To prove statement 8, first note that since PQ =1 this implies that 


IPQ] = |PIQI/|P.9 Q| = pq = |GI, 


and soG = PQ and G is generated by x and y. We observe further that Q<G implies 
x tyr € Q and so x~!yx = y* for some natural number k < q and so yx = xy". From 
this last equation, one can show by induction, we get that yx' = aiyk fori =1,2,.... 
In particular, yx? = xy which simplifies to y = y* or yk’-! = 1. But then q 
divides k? — 1 and so k? = 1 (mod q). We may assume that k 4 1 (mod q), for 
otherwise yx = xy and G is the cyclic group already discussed above. Since p is a 
prime and referring to Lemma 4.8 it must be that k = 1 and so the p distinct solutions 
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to x? = 1 (mod q) must be 1,k,k?,...,k?-+. For each s € {1,k,k?,...,k? +}, one 
can show that x and y® generate G with (y°)? = 1 and y’x = x(y*)*. Furthermore, 


one can check that the homomorphism which takes x to x and y to y® is, in fact, an 
isomorphism. Therefore, these seemingly p different groups are, in fact, the same up 
to isomorphism. 


Example 4.13 Here, we present instances of classifying groups of a given order pq 
using the theorem just proved. 


1. 


Groups of order 4 = 2-2 must be abelian and there are only two such groups, 
namely Z4 and Zz @ Zo. 


. For groups of order 6 = 2-3 we have 2 divides 3 — 1 and so there is a cyclic 


group of order 6, namely Ze and exactly one non-abelian group of order 6 which 
must be the familiar S3. 


. Groups of order 9 as we saw for order 4 are Zg and Z3 @ Z3. 


. For groups of order 10 = 2-5 we have 2 divides 5—1 and so there is the cyclic 


group Ziq and the non-abelian dihedral group Ds. In fact, this situation can 
be generalized as follows: If |G| = 2p for an odd prime p, then either G is 
isomorphic to Zo, or G' is isomorphic to the dihedral group Dp, since 2 divides 
p—l. 


. For groups of order 15 = 3-5 we have 3 does not divide 5 — 1 and so there is 


only the cyclic group Zy5. 


. For groups of order 21 = 3-7 we have 3 divides 7 —1 and so there is the 


cyclic group Zz, and a non-abelian group generated by x and y and satisfying 
the relations x’ = 1, y? =1 and xy = ykx where k #1 (mod 7) and k® = 
1 (mod 7). Here is a concrete way to describe this non-abelian group: 


1 a 
o={] : a€ Zz, bat = 1.2.4 b 


where the operation is matrix multiplication modulo 7. 


Example 4.14 Here is another sampling of finite group classification arguments us- 
ing the Sylow Theorems. In these examples our goal is to prove that G cannot be 
simple. 


1. 


Any group of prime power n > 1 order cannot be simple (ifn =1, thenG = Z, 
which is simple). There are two ways to see this. One way to see this is to 
point out that the center of the group is a non-trivial normal subgroup; now it 
could equal the whole group, but then G would be abelian and thus non-simple. 
Another way is to use the Sylow Theorems. By the First Sylow Theorem, if 
|G| = p” for a prime p and positive integer n > 1, then G has a p-subgroup, 
say H, of order p"~'. But then [G : H] = p and p is the smallest (in fact, 
only) prime dividing |G|. Therefore, by Ore’s Theorem, HG and so G is not 
simple. 
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. If |G| = pq where p < q are primes and p does not divides q—1, then G is not 
simple. Indeed, as in the proof of Theorem 4.12, the q-Sylow Q<G makes G 
not simple. 


. More generally, any group of order p"m where p is prime, n > 0, p does not 
divide m and1<m< p is not simple (orders such as 6 = 3-2, 10= 5-2, 15 = 
5-3, 18 = 37-2, 20 =5-4). By the Sylow theorems we have that ny = 1 (mod p) 
and divides m. This means np» = 1 or p+1, etc. But since m < p it is necessarily 
the case that ny» = 1 (there is not enough room in G for even two p-Sylows). 
This yields a unique normal p-Sylow in G, thus making G not simple. 


. Any group of order 45 = 37-5 is not simple, since by the Third Sylow Theorem, 
n3 = 1 (mod 3) and must divide 5 and son3 = 1. Therefore, by Corollary 4.5. tii, 
the unique 3-Sylow is normal in G, thus making G not simple. 


. The case that any group of order 56 = 2°-7 is not simple has a different sort 
of counting argument. By the Sylow Theorems, nz = 1 (mod 7) and divides 8. 
In the case that n7 = 1, then G will have a unique 7-Sylow which by Corol- 
lary 4.5.1 must be normal in G, hence making G not simple. In the case that 
n7 = 8, let Pi,..., Pg represent the eight 7-Sylows of order 7. Note that for any 
1<i<g <8 the subgroup P; 1 P; must be trivial being a proper subgroup of 
P, which has order 7. Furthermore, the siz non-trivial elements of each 7-Sylow 
have order 7 giving us |P, U--+U P| = (8)(6) = 48 elements of G of order 7. 
Since G has at least one 2-Sylow of order 23 = 8, this means G has an addi- 
tional 7 non-trivial elements of order a power of 2. But now we have counted 
all the elements of G. This means there is a single 2-Sylow in G which again 
must be normal in G, thus making G not simple. 


. Any group of order 30 = 2-3-5 is not simple. Using the Sylow Theorems one 
gets that n5 = 1 or 6 and nz = 1 or 10. As in the previous example the 5- 
Sylows intersect each other trivially and have four non-trivial elements of order 
5. If you had siz 5-Sylows, then you would have 24 elements of order 5. The 3- 
Sylows intersect each other trivially and have two non-trivial elements of order 
3. If you had ten 3-Sylows, then you would have 20 elements of order 3. Since 
|G| = 30, you cannot have both ns = 6 and n3 = 10 for otherwise you would be 
counting 45 distinct elements in G, an obvious contradiction. Therefore, either 
ns = 1 or ng = 1 which in turn produces either a 5-Sylow or 3-Sylow normal 
in G, thus making G not simple. 


. No group of order 96 is simple. Let G be a group of order 96 = 3 - 2°. By the 
Third Sylow Theorem, the number of 2-Sylows is congruent to 1 modulo 2 and 
divides 3 which implies there are either one or three 2-Sylows. If there is one 
2-Sylow, then it is normal in G and so G is not simple. Should there be three 
2-Sylows we show once again that G is not simple. Let P and Q be two of the 
2-Sylows. Consider the product PQ. We know that 


|P\|Q| 32-32 


96 2 Pl = Teng] = [Pana 
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Therefore, |PNQ| > %. Since P and Q are distinct and | PNQ| divides |P| = 32 
it must be that |PNQ| = 16. Set N equal to the normalizer of PO Q in G. 
Since the index of PO Q in both P and Q is two, PQ is normal in both P 
and Q and so P and Q are contained in N. Therefore, |N| > 32 (since both P 
and @ are distinct subgroups of N each of order 32). Since |N\| divides |G| we 
are forced to conclude that |N| = |G|. But this makes PQ normal in G and 
s0 G is not simple. 


. This final argument does not use the Third Sylow Theorem. Indeed, should 


one try to apply the Third Sylow Theorem, it would not lead to the desired 
conclusion. We show any group of order 224 = 2°-7 is not simple. Let P be a 
2-Sylow of a group G of order 224. Let G act on G/P by left multiplication. This 
induces a homomorphism 1: G + Sym(G/P). Since, |G/P| = 224/32 = 7 we 
have that Sym(G/P) = S7. We have seen already that for this particular action 
the normal subgroup ker a < P and so ker x is a proper subgroup of G. We show 
now that ker x is non-trivial which makes G not simple. If it were the case that 
ker = 1, then G would embed in Sym(G/P) and so |G| = 224 would divide 
|Sym(G/P)| = |S7| = 7!, but this is clearly not the case. We leave it as an 
exercise to show that if |G| = p"m with p{m and pt (m—1)!, then G is not 
simple. 
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Prove that any group of order 100 has a normal subgroup of order 25. 
Describe the 2-Sylows of Dio. 

Describe all the Sylow subgroups in S4. 

Classify the groups of order 33. 

Show there is no simple group of order 148. 

Show there is no simple group of order 48. 

Show there is no simple group of order 36. 

Show there is no simple group of order 225. 

Prove that the only simple groups of order < 60 are of prime order. 


In the proof of Theorem 4.12, verify that 


a. x and y® generate G with (y°)4 = 1 and y°x = 2(y’)*. 


b. the homomorphism which takes x to x and y to y’ is, in fact, an isomorphism. 
Prove that if |G| = p"m with p{m and p{ (m-— 1)!, then G is not simple. 


Check that, equipped with the techniques and results of this section, one can 
show order-by-order that there are no non-abelian groups of order < 60. 
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4.7 FINITE ABELIAN GROUPS 


In the section give the complete proof of a fact we illuded to earlier — we classify all 
finite abelian groups. We could easily extend this result to a larger class of groups, 
namely finitely generated abelian groups, but it would add too much to the compli- 
cation of the proof. 


Lemma 4.4 Let G an finite abelian group. 


1. For each prime p dividing the order of G there is a unique p-Sylow subgroup of 
G. 


2. If |G| = p"m and p{m, then the unique p-Sylow subgroup of G is 
P={geG: g? =I}. 
8. G is a direct sum of all its Sylow subgroups. 


Proof 4.22 Since G is abelian every subgroup is normal and so for each p-Sylow 
P<aG. Let Q be another p-Sylow. By the Second Sylow Theorem there exists g € G 
such that Q = g~'Pg, but g-'Pg = P and soQ =P. 

Set H={geEG : g? =1} which is a subgroup of G, since G is abelian. Now H 
is a p-subgroup of G, for suppose some prime q divides |H|. By Cauchy’s Theorem, 
there exists an element h € H of order q. Since hP" = 1, this implies q divides p” 
and so q = p. Hence, the order of H is a power of p. Let P be the unique p-Sylow 
of G. By the Second Sylow Theorem, there exists g € G such that g-'Hg C P and 
so H C gPg"! =P. For the reverse inclusion, since |P| = p” and any element of a 
group raised to the order of the group equals the identity, by the very definition of H 
we have PC H and so P=H. 

We prove the third statement by induction on the order of G. If |G| = 1 the result 
is trivial. If |G| > 1, suppose p divides |G| and write |G| = p"m with p { m. Set 
P={geEG : g? =1}, the unique p-Sylow inG andH={geEG : gW@=1} <G. 


Claim 4.1 G=PO6H 

First, if g € PCH, then o(g) divides both p” and m which are relatively prime 
and so o(g) = 1 which implies g = 1. Second, take any g € G. Since |G| = p"m 
with p {_m it must be that o(g) = p'k where 0 <1 <n with p{ k and k\m. Since 
gcd(p',k) = 1 there exist integers r and s such that p'r +ks = 1. Notice that g = 
ght tke — (gk\*(g?'\" © PH, since (gh)?” = (gh *\P"" = 17°" = 1 and (g?)™ = 


By induction H, which has order |G|/|P| < |G, is a direct sum of all its Sylow 
subgroups which in turn makes the result true for G. 


The next result is a technical lemma needed for the classification of finite abelian 
groups, and otherwise has little value. 
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Lemma 4.5 Let G be a finite abelian p-group and g € G be of maximal order in 
G. Set H = (g). For any aH € G/H there exists b € G such that (b) 0 H = 1 and 
bH =aH. 


Proof 4.23 Set s = 0(aH) in G/H so that a°H = (aH)* = H. Therefore, a° = g* 
for some positive integer k. Write k = pm with p” < o(g) and p{m. Note that 


my _ __ (9) (9) _ 
0) = Gedlogamy 19? 


Since (aH) = a) H = H this implies that s divides o(a). Therefore, 


_ ofa) _ ofa) 
gcd(o(a), s) s 


o(a*) 
On the other hand, 


8) 6 k =o(g?'™) =o my)pry __ o(g™) = o(g) o(g) 
1 Ol) = = MI) = edlo(g"),p) ~ ged(olg),p") ~ p” 


a) 


Therefore, o(a)p” = o(g)s. By assumption, o(a) < o(g) and both are powers of 
the prime p (since G is a p-group), thus s is also a power of p — in fact, s divides p”. 
Hence, we can write p” = st for some integer t. Set h = g™ € H. Note that 


hs mst m k s 


g as g =a. 


Set b = ah~'. We show that this is the b we are looking for as stated in the 
lemma. First, aH = bH, sinceb = ah~' € aH. Second, suppose that c € (b)NH. Then 
c= b§ =ath~ € H for some integer e. Therefore, a € H and so (aH)* = a° H = H. 
But then s divides e and we can write e = sf for some integer f. Hence, 


c= bf =b% = (b°)f = (ah) = (aa *)f = 1 = 1. 
Thus, the lemma is proved. 


Lemma 4.6 [fG is a finite abelian p-group, then G = (g)@K where (g) is a maximal 
cyclic group in G, t.e. there is noa€ G with (g) & (a) CG. 


Proof 4.24 The proof is by induction on the order of G. If |G| = 1 the result holds 
trivially. If |G| > 1, then |G| = p” with n > 1. Should G be cyclic, then G itself is 
maximal cyclic and K = 1. We can therefore assume G is not cyclic with |G| = p” and 
n> 1. Since G is finite, by well ordering, we can always find a g € G with (g) cyclic 
and of maximal size. Set H = (g). By Cauchy’s Lemma we know |H| > p and so 
|G/H| < |G|. Since G is finite, by a finite number of applications of induction we can 
write G/H = H,®---@H,, where each H; is maximal cyclic in G/H. By Lemma 4.6, 
we can choose a1,...,dn € G such that each H; = (a;H) and (a;) H = 1. Set 
K = (aj,...,@n). The following claim will complete the proof: 
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Claim 4.2 G=H@O@K 

First, if € HO K, then « = aj"---am™ © H. Therefore, H = cH = 
(a,H)™ --+(GnH)™. Since we have unique representation in H, ®--- ® Hy and 
H = (a,H)°-++(a,H)°, it must be that my =--- = mp, =0 and sox =a)---a9 =1. 
Second, take an x € G. Then eH € G/H and so cH = (a,H)™---(a,H)™ = 
(aj"---a")H. Thus, x € (ay --- af") = H(ay"--- at) € AK. 


Corollary 4.6 If G is a finite abelian p-group, then G is a direct sum of cyclic 
subgroups of prime power order. 


Proof 4.25 This follows immediately by a finite number of applications of 
Lemma 4.6. 


We are now ready to classify finite abelian groups. We will, in fact, give two 
classifications in the sense that we will give two different ways of listing the non- 
isomorphic abelian groups of a given finite order. 


Theorem 4.13 (Classification of Finite Abelian Groups I) Jf G is a non- 
trivial finite abelian group, then G is isomorphic to a direct sum of non-trivial cyclic 
subgroups of prime power order. Furthermore, this direct sum representation is unique 
up to isomorphism and order. 


Proof 4.26 By Lemma 4.4, G is a direct sum of its Sylow subgroups and then by 
Corollary 4.6 each of these Sylow subgroups can be written as a direct sum of cyclic 
groups of prime power order. This proves the existence part of the theorem. 

We now show the uniqueness of this representation. Without loss of generality, 
we can assume G is a p-group, since by Lemma 4.4, G is a direct sum of its Sylow 
subgroups. We prove existence in this simpler case by induction on the order of G. 
If |G| = 2, then G is cyclic and G itself is the unique representation of itself as a 
product of cyclic groups of prime power order. Now assume |G| > 2 and suppose 
G=H,@:---@®H, andG = Kk, @-:-OKm where the H; and K; are cyclic subgroups 
of G of prime power order. Suppose for each i we have H; = Zpx and K; = Z,», and 
by reordering we may assume a, > ag > ++: > an and by > bg > +--+ > bm. Consider 
the map 6: G— G by o(g) = g” which is a homomorphism, since G is abelian. Let’s 
focus on the image of this map. 


1. The kernel of this map consists of elements of order p and is non-trivial by 


Cauchy’s Lemma. By FTH, G/ ker ¢ = ¢(G) and so |¢(G)| = |G|/| ker ¢| < |G|. 
2. o(G) = H? ®---® HP where each Hj = Z,o,-1. We should point out that the 


sum remains direct, since 
AP O(AP +--+ AP, +AP +--+ HP) 
CA;N(A,4+--- Ay-1 + Hig. +--+ + An) a 
For each H; = Z, note that H? = 1. If k equals the number of H; isomorphic 
to Z,, then o(G) = H? ®---@ H?_,. Of course a similar thing happens to the 
representation of G= Kk, @---@ Ky yielding o(G) = KP @---@ K? _, where 
l equals the number of Kk; isomorphic to Zp. 
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Hence, we may invoke induction on ¢(G) to get aj—1 = b;-1 fori = 1,2,...,n—k 
andn—k=m-—l. Thus, it remains to show that k =1. To see this, note that 


| Hy ||H2|---|Hn| = |Ai 6 He @--- 6 H,,| = |G| 


= |Ki © Ky ++: ® Km| = |Ki||Kol-- + |Kml- 


By cancelling terms we now know are equal we get |Hn—r+i\---|Hnl = 
|Km—i+1]++:|Km| and since the remaining subgroups are all isomorphic to Zp, we 
have p* = p! and sok =1. 


Example 4.15 Let G be an abelian group of order 200 = 2? - 5?. Therefore, by 
Lemma 4.4, G has a unique 2-Sylow P of order 8 and a unique 5-Sylow Q of order 
25 with G = P@Q. Now by Theorem 4.18, P is isomorphic to either Zg, Z4 ® Zo, 
or Ze ®@ Z2 @ Zo. Similarly, Q is isomorphic to either Zo5 or Z5 © Zs. Therefore, the 
complete list of non-isomorphic abelian groups of order 200 is 


Zg@Zo5, LZaPLZ,0L25, LZaPZ,.022,, “Zea OZ, 024; 


Lo 8 Ly PB LZy BP Lo5, LZoB 2202, 025 6 Zs. 


Theorem 4.14 (Classification of Finite Abelian GroupsII) [f G is a non- 
trivial finite abelian group, then G is isomorphic to a direct sum of non-trivial cyclic 
subgroups, G= H, ®@ Hp ®---@ Ay, with |H;| dividing |Hj_-1| for i = 2,3,...,n. 


Proof 4.27 The proof is by induction on the number of distinct prime divisors of 
G. For the base case, G is then a p-group and, by Corollary 4.6, can be written as 
a direct sum of cyclic groups of prime power order. Now simply rearrange the cyclic 
subgroups into decreasing order. The subgroups successively divide, since they all have 
order a power of the same prime. 

Now assume more than one prime divides the order of the group. Suppose p is a 
prime dividing |G| and P is the p-Sylow subgroup of G. As in the proof of Lemma 4.4, 
we can write G = P@®H. By induction, we can represent H = H, @---®@ Hy, where 
each H; is cyclic and |H;| divides |Hj_,| for i = 2,3,...,n. By Corollary 4.6, we 
may write P= Kk, ®---@ Km where each K; is a cyclic group of order a power of 
p. Suppose for each i we have H; = Zn, and Kj = Zp and by reordering the kK; we 
have ay > a2 >-+: > Gm. Note that for each i < min(m,n), since gcd(nj, p“) = 1, 
Exercise 6 in Section 2.5, we know H,+K; = H;®K;, remains cyclic (of order nyp™ ) 
and |H; ® K;| still divides |Hj4, ® Ki41|. Therefore, pair off in successive order as 
many H; and K; as possible to get the desired representation. 


Example 4.16 The proof just presented gives the algorithm for producing the desired 
representation. Let’s return to abelian groups G of order 200. Recall that G has a 2- 
Sylow P of order 8 and a 5-Sylow Q of order 25. As before P is isomorphic to either 
Zg, L4@Zo, or Zo 8 Zy@8 Ze and Q is isomorphic to either Zo, or Zs ® Zs. Then 
the complete list of non-isomorphic abelian groups of order 200 is 


Zs ® Zo5 = Zao0 
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Zg ® (Zs © Zs) = (Zg © Zs) © Zs = Zap © Zs 
(Za ® Zz) ® Zo5 = (Zq B Za5) B Zz = Zr00 B Ze 
(Za ® Za) © (Z5 ® Zs) = (Za 
(Zz @ Ze © Ze) OZ 


425 
(Zo Le Zz) (Zs Zs) 


eS 
R 
ou 
SS 


(Zo ® Zs) = Zao © Z10 


DW. ad DW. a 


r r Two r 
= (Zp 4 195) 4 420 42 = 450 WY 42 0 42 


(Zo @ Zs) ® (Ze © Zs) © Zo ~ Zio © Zi10 © Ze 


Corollary 4.7 If G is a finite abelian group and d > 0 divides the order of G, then 
G has a subgroup of order d. 


EXERCISES 


1 Classify the abelian groups of the given order first using Theorem 4.13 and then 
using Theorem 4.14 


a. Abelian groups of order 35. 
b. Abelian groups of order 20. 
c. Abelian groups of order 36. 
d. Abelian groups of order 72. 
e. Abelian groups of order 216. 
f. Abelian groups of order 30. 


2 Prove that if G is an abelian group of order square-free (i.e. a product of distinct 
primes), then G is cyclic. 


3 Prove Corollary 4.7. 


CHAPTER 5 


Group Presentation and 
Representations 


N THIS CHAPTER, we introduce two topics in group theory which can each fill an 
| Bemes textbook. Therefore, one can view this chapter an giving exposure to this 
two topics with a light overview. In Section 5.1, we introduce the notion of a free 
group, which is a group generated by a set of elements, as a prelude to Section 5.2, 
group presentations, in which the group generators have additional rules for how 
they relate to each other. The free groups are free of these additional rules. Finally, in 
Section 5.3, we introduce group representations, in which we represent group elements 
by matrices. 


5.1. FREE GROUPS 


One can think of free groups as groups defined with the minimal amount of structure, 
i.e. the group axioms themselves. We start with the definition of free groups. 


Definition 5.1 A group F is free if there is a subset X of F' satisfying the following 
property: For any group G and function f : X — G there is a unique homomorphism 
extending f to F, i.e. there is a homomorphism ¢: F — G such that o(x) = f(x) 
for alla eX. 

In this case, X ts called a basis for F and we say that F is free on X. 


To get a feel for this definition, consider the setting of a vector space V. Recall 
the result that if X is a collection of basis vectors for V and we assign each 7 € X 
to a vector in another vector space W, say f(x), then there is a unique vector space 
homomorphism (typically called a linear transformation) which sends each « € X to 
the same place that f did. Therefore, in this setting every vector space would be free. 
Such is not the case for groups as we shall see. 


Example 5.1 The additive group Z is a free group on the singleton set X = {1} 


or X = {-1}. Indeed, if we have a function f : {1} > G, then we can define a 
homomorphism @: Z— G by o(n) = f(1)”. 
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The following result says there is a unique (up to isomorphism) free group on a 
set of a fixed cardinality. 


Theorem 5.1 I/f F is a free group on X and F" is a free group on X' and |X| = |X", 
then F =F". 


Proof 5.1 Since |X| =|X’|, there is a bijection g: X — X'. Consider the inclusion 
maps f : X > F and f': X' > F’. Seth = (f')" ogo f a map from X to F' 
and h' = f-'og-'o f' a map from X' to F. Since F and F" are free on X and 
X"' (respectively), there are homomorphisms ¢ and ¢' extending h to F and h’ to F’ 
(respectively). Notice that ¢' 0 6 = 1p, since the restriction 


(sod) [X=fogtoflo(f)'ogof = 1x, 


and since 1p is another homomorphism extending 1x and F is free on X, by unique- 
ness, d' od= 1p. Similarly, 60 ¢! = 1 and so ¢ is an isomorphism from F to F" 
and thus F & F". 


Theorem 5.2 The following are true: 
1. If F is free on X, then X generates F, i.e. F = (X). 


2. F is a free group on X iff F = (X) and for any group G and function 
f:X 3G there is a homomorphism extending f to F. 


3. For every non-empty set X there is a group F such that F is free on X. 


Proof 5.2 For the first statement, consider the inclusion map f : X — (X). Since 
F is free on X we can extend this map to a homomorphism @ : F — (X). Since 
(X) < F we can define an inclusion monomorphism w : (X) + F and it follows that 
(woo) [| X =1x and so Wo @ extends the map 1x : X > F. Since 1p also extends 
lx, by uniqueness it follows that wo d= 1p. Therefore, the inclusion map w is also 
surjective and thus (X) = F. 

To prove the second statement, we already have one direction by what we just 
proved. Therefore, assume that F = (X) and for any group G and function f :X 4G 
there is a homomorphism 6: F + G eatending f to F. It suffices to show that this 
homomorphism is uniquely defined. To see this, notice that an element of F has the 
form x51 ---x& where the x;’s are in X (perhaps repeats) and n,€1,...,€n are positive 
integers. Since @ is a homomorphism we know that 


PCH + Uy) = PH) + Plan) = Fle ++ fn). 


Hence, the homomorphism @ is completely determined by f and therefore must be 
a unique extension. 

To prove the third statement, for our set X we define the corresponding set of new 
symbols X' = {a' | « € X} clearly of the same cardinality as X (X' will represent 
the corresponding inverses of the elements of X). Set Y = XUX'U{e}, where e is a 
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symbol not appearing in X or X' (and will denote the identity element). Now define 
the following sets: 
S={yiy2-+- | each yi € Y}, 


W ={yyo-: | each y;€Y and ine Z*, y,=e fori>n}. 


Let R be the elements in W for which no x and its corresponding x' ever appear 
adjacent to each other, and e never precedes an element of X UX’. The set S repre- 
sents infinite strings or words in Y, W represents finite words in Y and R is called 
reduced words in Y. A typical element in R can be represented as yj! -+-y;*, where 
adjacent y; and yi41 are both distinct in X UX" and not of the form x and x’ (for 
some x € X) and represents an abbreviation for 


Y1°°° U1 UR YR ECE: fc 
SS —SS_ 


e, times e, times 


Set 1 = ee---. One can easily check that F = R is a group via the operation 
concatenation and simplification to a reduced word via the following identities: for 
any x € X we have xa’ = e and xe = x (and so x’ is indeed the inverse of x and 
e is the identity element in F'). It remains to show that F is free on X. To do this 
we employ the result just proved in part 2. First, F is certainly generated by (X) 
by its very definition. Second, for any map f : X — G define the homomorphism 
o:F 4G by 

PY + Ue) = PCY) ++ OUR), 


where 6(x') = ¢(x)~! for any x € X. 
Remark 5.1 Several remarks are in order. 


1. Having proved Theorem 5.2.3, we can see where the terminology free comes from 
in the sense that the set X freely generates F without putting any conditions 
on X, such as xy = yx, for instance. 


2. Without loss of generality, by Theorem 5.1, we can always assume if necessary 
that our free group is the one we just constructed in Theorem 5.2.8. 


3. Notice that the construction of the free group F' in the proof of Theorem 5.2.8 
dictates that the identity element of F is not in X. 


Definition 5.2 [f F is a free group on a set X, then the rank of F is equal to the 
cardinality of X. 


Remark 5.2 We make some further remarks. 


1. If F is a free group of rank at least 2, then F is not abelian. Suppose, to the 
contrary, it were abelian and take x,y € X. Since F is abelian we have xy = yx 
or x ty 'wy =1, ie. a ty~txy reduces to 1. However, x~ty~txy is clearly a 
reduced word, since x # y~' — a contradiction (using a similar argument one 


can show the stronger result that such a group has a trivial center). Note this 
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also proves that the free groups which are abelian are exactly the free groups 
generated by a single element which are infinite cyclic and therefore isomorphic 
to Z. 


2. No element of a free group can have finite order except of course 1. Suppose, to 
the contrary, that some reduced word x1 -+-%n € (X) = F had finite order. Then 
for some positive integer k we would have (x, +++ xn)" = 1, i.e. in particular the 


word 
(@1-°++Dn)-++(L1-++Ln) reduces to 1. 
—_—_—— 
k times 
But the only way that could occur is if tp = ee, an =X, ... etc. Ifn is even, 


then aig = L(n/2)41 and 1 -+++Lp reduces to 1 contrary to our assumption. If 
n is odd, then X(n41)/2 ts tts own inverse which is impossible according to how 
F was constructed in Theorem 5.2.8. Note that this result in turn shows the 
generators of a free group necessarily have infinite order, and thus every free 
group ts infinite. 


3. Not every group is free. As we have seen in a remark above, any abelian non- 
cyclic group cannot be free (in fact, any group with a non-trivial center cannot 


be free). 


4. A subgroup of a free group does not necessarily have the same rank as the 
entire group. In fact, as we shall see in the following example, the group may 
have finite rank while the subgroup has infinite rank. Consider the free group 
of rank 2 generated by X = {x,y} and define the subgroup H = (Y) where 
Y = {x'yx* | i = 1,2,...}. One can show that H is free on Y where Y is 
clearly an infinite set. 


5. A free group can only have a single rank, for suppose that F' were free on both 
X and Y with |X| > |Y|, then choose x € X —Y. Since F is generated by Y, 


we can express x = yj'--- yo (reduced) for some y1,..-,Yn € Y and €j,...,€n 
integers. But then x~tyj! +--+ yS" =1 or reduces to 1, which is not possible. 
EXERCISES 


1 Verify that @ defined in Example 5.1 is indeed a homomorphism. 


2 Verify in the proof of Theorem 5.2.3 that F’ = FR is a group via the operation 
concatenation and simplification to a reduced word via the following identities: 
for any x € X we have rz’ =e and re= 2. 


3 Referring to Remark 5.2.4, verify that H is free on Y. 


4 Let F be a free group on X = {21,29}. Set y. = 27, yo = x2, y3 = T12q and 
Y = {y1, yo, y3}. Prove that G = (Y) is free on Y. 
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5.2 GROUP PRESENTATIONS 


In this section, we take a free group a step further by putting restrictions on the 
generators thus making the them no longer independent or free. These restrictions 
will be called relations and the generators together these relations between the 
generators will constitute the presentation of a group. In a sense, we exhibit the 
essential or pure description of the group. We now give the formal definition. 


Definition 5.3 Let F' be free group on a set X and Ra subset of F. Then (X | R) 
is called a group presentation. 


Example 5.2 Here are some group presentations which we shall look at in detail 
during this section: 


1. Let X = {x} and R = {x"}, for some positive integer n. Then certainly the 
free group on X is isomorphic to Z (see proof of Theorem 5.2.8). We shall show 
that the group presentation (x | x") represents a cyclic group of order n. 


2. Let X = {x,y} and R = {x?,y*,a7-ly xy}. We shall show that the group 


presentation (x,y | x7, y?,a2~'y~lxy) represents the Klein-4 group. 


3. Let X = {x,y} and R = {x-'y~txy}. This group presentation represents a 
free abelian group. 


4. Let X = {x,y} and R = {x*,x?y~?, ryxy'}. We shall show that this is the 
generators and relations for the quaternions. 


5. Let X = {81,635 1..,0 4} and 
R= {el (jez), (pai)? | 4,6) = ln; fH 1,2. 0 29 < kp 
This group presentation represents, Sy, the symmetric group on n elements. 


6. Let X = {x,y} and R= {x?,y",ryry}. This group presentation represents the 
dihedral group Dn. 


7. The Generalized Quaternions have group presentation (x,y | yxyx ',cyxy'). 


Now we give the formal definition of what it means for a group to be represented 
by a group presentation. First note that for any subset R of a group G there is a 
notion of a smallest normal subgroup Np of G containing R. Indeed, this result holds 
and is left as an exercise for the reader. 


Lemma 5.1 Let R be a subset of a group G. 


1. There exists a smallest normal subgroup Nr of G containing R in the sense 
that if NAG and N 2 R, then Nr CN. 


2. The Np in the first statement is the intersection of all normal subgroups of G 
containing R. 


Group Presentation and Representations M 141 


3. The Nr in the first statement equals 


{rp% --- r= | 7, ER, gg EG, nEZ*Y. 


Hence, Nr is sometimes called the normal subgroup of G generated by R. 


Definition 5.4 Let F be a free group on a set X and R a subset of F and let Nr 
be the normal subgroup of F' generated by R. A group G has presentation (X | R) 
— or G is defined by generators X and relations R — or (X | R) is a group 
presentation of G - if G = F/Nr. 


Notice that in the group F'/Np the relations are being equated with the identity, 
so that a group having such a presentation has additional constraints, namely that 
each element in the relations is now equal to 1. For this reason we define the phrase 
that a group G satisfies the relations FR if there is an epimorphism ¢: F' > G 
such that Nr < ker(@). We now list some useful results that will help us link groups 
to their presentations. 


Lemma 5.2 Consider a group presentation with generators X and relations R. Let 
F be the free group associated with X. 


1. If G has presentation (X | R), then G satisfies the relations R. 
2. The group F/Nr = (tNr : ce X) 


&. If G is a group with f : X > G a map satisfying the property that for all 
r=ay--- xe © R we have f(x)" --- f(an)™ =1, then there exists a unique 
homomorphism ® : F/Nr — G such that ®(aNr) = f(x) for alla Ee X. 


4. If a finite group G satisfies the relations R, then there is an epimorphism w : 


5. If a finite group G satisfies the relations R and |G| < |F/Npl, then G has 
presentation (X | R). 


Proof 5.3 For the first statement, we are given that there is an isomorphism from 
w: F/NR > G. Consider the quotient homomorphism v : F + F/Nr. Then the 
composition ov: F — G is an epimorphism whose kernel is precisely the normal 
subgroup Nr (exercise) and thus contains Nr. 

The second statement is immediate, since F = (X). For the third statement, 
since F' is free on X we can extend f to a unique homomorphism ¢: F > G. By 
our assumptions on f we see that o(r) = 1 for all r € R and so Nr < ker(@). 
Since Nr < ker(¢), we have a well-defined homomorphism x : F/Nr > F'/ker(@) by 
x(aNpr) = xcker(¢) (see Exercise 3 of Section 2.10). By the Fundamental Theorem of 
Homomorphisms, there is a monomorphism w : F/ker(¢) > G by v(gNr) = ¢(g). 
Then ® = Woy is the desired homomorphism uniquely determined since was 
uniquely determined. 
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For the fourth statement, since G satisfies the relations R, there is an epimorphism 
@: F + G such that Nr < ker(¢). Therefore, using the Third Isomorphism Theorem, 
we have 


F'/ker(o) = (F/Nr)/(ker(¢)/Nr). 


Call this isomorphism x mapping (F'/Nr)/(ker(¢)/Nr) to F/ker(¢). By FTH, 
there is an isomophism ® : F'/ker(¢) + G. Consider the quotient epimorphism 
v: F/Nr > (F/Nr)/(ker(¢)/Nr) by v(aNr) = (aNr)(ker(¢)/Nr). Then the com- 
position Poy ov is the desired epimorphism. 


F/Np > (F/Nr)/(ker(¢)/Nr) “+ F/ker($) => G. 


For the fifth statement, since |G| < |F'/Npr|, the epimorphism in the fourth state- 
ment is now an isomorphism thus proving the result. 


Example 5.3 Lemma 5.2 provides us with the tools necessary to associate groups 
with their presentations. 


1. Let X = {a,y} and R = {x?,y?,x-ty"!zy}. For brevity, set N = Np and 
G = F/Nr. First note that since x~'y-!xy € N, this implies ryN = yrN. 
Furthermore, the fact that G = (aN,yN) implies G has at most 4 elements. 
Indeed, (xN)? = 2?>N =N, (yN)*2 = y?N = N and (aN)(yN) = (xy)N = 
(yx)N = (yN)(xN) so that the only elements in G that can be distinct are N, 
«tN, yN and (xy)N. Let H = Zy x Zz be the Klein-4 group and define the map 
f :X > A by f(x) = (1,0) and f(y) = (0,1). Since F is free on X there is 
a homomorphism ¢: F + H extending f. Since f maps onto the generators 
of H, © is, in fact, an epimorphism. Furthermore, it is easy to see by how 
f is defined, that R C ker(@) and so the smallest normal subgroup containing 
R, Nr < ker(@). Hence, H satisfies the relations R and |H| = 4 > |G| 
which implies by Lemma 5.2.5 that H = G and so the group presentation 
(x,y | x7,y?,v-ly~txy) represents the Klein-4 group. 

2. Let X = {x} and R= {x"}, for some positive integer n. Again, set N = Nr 
and G = F/Np. As in the previous example, we note that G = (xN) and 
(cN)" = N so that G has at most n elements. Let H = Z,, and define the map 
f:X > FA by f(x) =1 which extends to a homomorphism ¢: F + H. Again, 
@ is an epimorphism with R C ker(@) which proves H satisfies the relations R 
and since |H| =n > |G| we have H =G. 


3. Let X = {2,y} and R = {z+,a°y-?, zyzy—'}. Again, set N = Np and G = 
F/Np. Recall (see Exercise 2c. in Section 2.7) that one way to represent the 
quaternions is as follows: Let 


0 1 0 7 
A=| Hq and a=|7 a | 
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Then the quaternions, H = (A, B), the subgroup of Mz2(C) (under matrix 
multiplication) generated by A and B. One can easily check that At =I, A? = 
B? and ABAB™ = I. Define the map f : {x,y} > H by f(x) = A and 
f(y) = B which extends to a homomorphism ¢: F + H. Again, @ is an 
epimorphism with R C ker(@) which proves H satisfies the relations R. The 
difficult part of the example is to show that G has at most 8 elements (if we 
can, then G = H as it did in the examples above). First note that (xN) is 
a normal subgroup of G of order < 4. The bound on the order follows, since 
(aN)4=2*N=N. The fact that (cN) dG follows from two observations: G 
is generated by xN and yN and 


(yN) "(aN )§(yN) = (yo taty)N = (ya ayy tya")N 
= (yta®)N(axysy™)N (yo!) N = (ya) N (ya) N 
= (yla* lye 1) N =--- = (ylya*)N = a-*N = (tN)-* € (aN). 
Second, we show that G/(xN) has at most two cosets (this will complete the re- 
sult, for then |G| = [G: (aN)]|(aN)| < (2)(4) = 8). We show that G/(aN) has 
at most the two cosets (xN) and (yN)(xN). This follows from the following two 
observations: First, these cosets generate G/(xN), since xN and yN generate 


G (note that (xN) = (xN)(xN)). Secondly, observe the following products of 
the generators: 


(aN)\(xN) = (xN), since (xN)? = a°N € (aN). 


(yN)(2N)(N) = (yN){aN), since (yN)(2N)(aN) = (yN)(x*N) € (yN){aN). 


(cN)(yN)(xN) = (yN)(xN), since (2N)(yN)(xN) = (xyx)N 
= (yN) € (yN)(xN). 


(yN)(yN) = (@N), since (yN)? = y2N = (27222) N(y2N) = (@N)-? € (aN). 


Remark 5.3 Presentations for a fixed group are not unique. For instance, one can 
show that Zy is defined by the generators X = (x,y) with relations R= {x",x~1y}. 


Example 5.4 Using Sylow Theory and Group Presentation we are now in a position 
to classify groups of certain small orders. To illustrate this we will classify groups of 
order 6 and 8. 


1. Let G be a group of order 6. Using Sylow Theory one can show that G has 
a cyclic 2-Sylow of order 2 — call it P = (x) with o(a) = 2; and a normal 
cyclic 8-Sylow of order 3 — call it Q = (y) with o(y) = 3. There are two 
possibilities: either x and y commute or they don’t. If they do, then as in the 
proof of Theorem 4.12, the order of xy is 6 and G = Z¢. Let’s consider the case 
when ry # yx. One can check that the six elements of G can be represented as 
l.z,y,y?, ry, ry’ and as a result yx must equal xy”, or equivalently xyxy = 1. 
But this is the presentation which uniquely defines S3 and thus G = S3. 
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2. Let G be a group of order 8. If G is an abelian group, then the classification 
is known. Indeed, we know that G is isomorphic to either Zg, Z4 x Zor 
LZ X Ly X LZ. So we may assume that G is non-abelian. Now G has an element 
of order 4, otherwise g? = 1 for allg € G and so G would be abelian. Set 
x €G with o(x) =4 and define H = (x). Since |G: H] = 2 we know H AG. 
Let G/H = {H, yH} for some y ¢ H. Since G/H & Zy we have (yH)? = H 
and so y? € H. Now y? 4 x, for otherwise o(y) = 8 making G & Zg abelian. 
Likewise y* # x°, since o(x*) = 4. Hence, either y? = 1 or y? = x. Now 
since HAG we know yxy! € H, and since o(yxy—') = o(x) = 4, then either 
yxy += 2 or yey | = x. We show yxy! # x, for otherwise xy = yx. But, 
since G/H = {H, yH} and H = (2) this implies G = (y,x) and so G would 
be abelian. Thus, yey~! = x3. So we have two group presentations: 


Go= (ay |e =1, y ='1, zyey = 14, 
Go = ay | a? = 1, ao? =—1, eysyt =1). 


We’ve seen these group presentations earlier. Indeed, G; = D4 and G2 = Qs. 


EXERCISES 


1 Show that S3 has presentation ( z,y | 2?=1, y®=1, (xy)? =1). 
2 Show that A, has presentation ( 2, y | g=1,y=1, (xy)? =e i) 
3 Prove all parts of Lemma 5.1. 


4 Inthe proof of Lemma 5.2.1, verify that wov : F — Gis an epimorphism whose 
kernel is precisely the normal subgroup Np. 


5.3 GROUP REPRESENTATION 


In this section, we give a very naive introduction to group representation. The topic 
of group representation is an immense topic which is far too big to fit in a single 
section let alone an entire chapter. Basically we show how one can represent groups 
as a collection of matrices over the complex numbers. More specifically, we will be 
looking at linear finite dimensional representations of finite groups. 


Definition 5.5 Let G be a finite group. A linear representation of G is a group 
homomorphism ¢: G— GL,(C) where n is called the degree of the representation. 
A representation is faithful if ¢ is a group monomorphism. 


Example 5.5 Here, we list several examples of (faithful) group representations, some 
of which the reader has already encountered in the tect. 


1. For any finite group G the trivial map ¢(g) = In for all g € G is the trivial 
representation of G of degree n. This representation is faithful only for the 
trivial group. 
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2. A degree 1 representation of a group G is a group homomorphism ¢: G > C* 
and since G is of finite order this implies that ¢(g) must be a root of unity for 
all g € G, and therefore the image lies on the unit circle in the complex plane. 
In particular, if G is a cyclic group of order n, then @(g) is an nth root of unity 
for allg EG. 


For instance, if G = (g) is cyclic of order 4, then @ maps onto {+1, +i} = (i). 
Now $ maps the identity to 1 and @ is determined by where it sends g, i.e. 
o(g) = —1,i or -i, so there are three representations of G. 


3. IfG = {1,9,h,k} is the Klein-4 group, then the following is a degree 2 repre- 
sentation of G: @: G— GL2(C) by 


oa =[j ik (9) = ie af o=|} i: o(k) = es =i 


4. If G = {+1,+i,+j,+k}, the quarternions, then the following is a degree 2 
representation of G: ¢: G— GL2(C) by 


0 1 . 0 2 
Note that Qg has presentation (x,y | «4 = 1,27 = y?,ryx = y) and one can 


check that x = $(i) and y = (J) satisfy these relations. 


5. Another degree two representation of the quaternions is the following: ¢: G—> 


GL2(C) by . 
“=|, 4) and 6(j) = le Ae 
4 _ 


Again, one can check that x = $(i) and y = $(j) satisfy the relations x* = 
1,2? = y?, xyx = y. 


6. A degree 2 representation of a cyclic group is easy to understand in terms of 
a rotation group. Let G = (g) with o(g) = n. Define the linear representation 
@:G— GL2(C) by 


7. The dihedral group made up of rotations and reflections of an n-gon has a 
similar representation. The rotations are represented just as we did for the 
previous example of a cyclic group and the reflections are sent to 
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where the matrix above represents the reflection across a line that makes an 
angle of tk/n with the x-axis. 


8. A degree four representation of the quaternions is the following: ¢: G— 


GL4(R) by 
6: Sot: wos “0 Oe eet 
L, %01 08- ‘<0 Oe 303 Or a 
M)=)5 oo -1| 4 PH=11 6 oO 0 
0 O01 0 0: Sh. 0-0 


Again, one can check that x = (i) and y = @(j) satisfy the relations x*+ = 


Leh ayy. 


9. The group S;, has a representation of degree n defined as follows: Let e1, €2,...€n 
be the standard basis for C. We shall view elements of GL,,(C) as linear trans- 
formations and define the group representation @ : Sy, — GLy(C) in terms 
of this basis. In other words, foro € Sy, the image ¢(c) will be defined as 
P(a)e; = eos). One can show that @ is indeed a group homomorphism. Such 
a representation is an example of a permutation representation. Let’s take 
the specific example of S3 and let’s represent the permutation o = (1 2 3). 
Therefore, 


Hence, 


These are the so-called permutation matrices. 


10. A group G acting on a finite set X = {x1,%2,...,%n} induces a permutation 
representation, since G can be viewed as a collection of functions of the set X. 
Indeed, for each g € G, define the map fz: X — X as follows: fg(x) = gz, 
where gx is the action of g on x. In the previous example, these functions 
were bijections, but they need not necessarily be so. As in the previous example 
we view elements of GL,(C) as linear transformations and define the group 
representation 6: G — GL,(C) in terms of the standard basis. If we index 
our standard basis by the elements of X, then the group representation of G is 
@:G—+ GL,(C) and is given by d(g)ex; = €gz,- This representation will be 
faithful iff the action is faithful (a group action is faithful if gx = hx implies 
gS 


Remark 5.4 We make several remarks related to the examples given above. 


1. From our examples it is clear that group representation is not unique for a given 
group, even faithful group representations. Take for example the representation 
of a cyclic group with a degree 1, then a degree 2 representation. 
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2. Group representation is not unique even if we fix the degree. Indeed, if 6: G7 
GL,(C) is a group representation of G of degree n, then for any invertible 
matric A € GL,(C), the map py: G— GL,»(C) by w(g) = A14(g)A is also a 
group representation of G of degree n. Hence, there are, in fact, infinitely many 
different representations of a fixed degree for a given group G. 


3. Since every finite group of order n can be embedded in S, (Cayley’s Theorem) 
and S, has a faithful permutation representation, this implies that every finite 
group has a faithful linear representation. 


EXERCISES 


1 Verify for Examples 5.5.4,.5,.8 that « = $(i) and y = ¢(j) satisfy the relations 
a =1,2? = y'*,cyr = y. 


2 As was done in Example 5.5.9, compute the representation of 7 = (1 3). 


3 Using Remark 5.4.3, compute a permutation representation of the Klein-4 
group. 


CHAPTER 6 


Solvable and Nilpotent 
Groups 


N THIS CHAPTER, we introduce two families of groups: nilpotent and solvable. One 
| can think of these groups as generalizations of abelian groups. The main reason 
we cover this theory is for the application of solvable groups to answer the question 
if there exists a formula for finding the roots of a polynomial of degree five or more, 
otherwise known as solvability by radicals. Solvable groups are named as such because 
of this connection. In Section 6.1, we remind the reader of some important subgroups 
as well as introduce some additional important subgroups. In Section 6.2, we discuss 
certain chains of subgroups of a group which we use in Section 6.3 to define nilpotent 
and solvable groups. 


6.1 SOME RELEVANT SUBGROUPS 


There are several subgroups of a given group that come up a lot in our discussion of 
nilpotent and solvable groups. This section is meant to present to the reader, remind 
the reader in some cases, and discuss these subgroups as well as their implications. 


Definition 6.1 For a given group G the center of G, written 
Z(G) ={zeEG : zg=gz forall gE Gh. 


Remark 6.1 Here, we list some examples and results that the reader may also wish 
to verify. 


1. Z(G) is always an abelian normal subgroup of G. 


2. Z(G) is characteristic in G (a subgroup H is characteristic in a group G if 
for every automorphism @ of G, we have ¢(H) < H). 


3. The center of GL,(F) is the collection of all scalar matrices (i.e. matrices of 
the form al, where a € F*), where GL,(F) is the group of n x n matrices over 
a field F with matrix multiplication. 
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4. The center of SL,(F) is also a collection of all scalar matrices with the added 
restriction that a” = 1, where SL,(F) is the subgroup of GL,(F) with deter- 
minant equal to 1. 


5. The center of U3(F) , strictly upper triangular 3 x 3 matrices, are the matrices 


1 0a 
of the form | 0 1 O |, whereae F. 
00 1 


6. The center of the dihedral group D4 is the subgroup containing the identity and 
180° rotation. 


7. Consider the epimorphism t: G — Inn(G) defined by u(g) = ig, where ig is 
the inner automorphism defined by ig(x) = gzg~'. Then ker(t) = Z(G) and so 
G/Z(G) = Inn(G). 


8. For any group G and g € G, we have that g € Z(Ca(g)) where Ce(g) = {a € 
G : ga=ag}. Hence, every element of a group G lies in an abelian subgroup 


of G. 
9. If G/Z(G) is cyclic, then G is abelian. 
10. Z(Sym(X)) = 1 when |X| > 3. 
The following result is not so easy to deduce and so we shall prove this in detail: 


Theorem 6.1 Jf G is a group with trivial center, then Aut(G) has a trivial center 
as well. 


Proof 6.1 Let’s call a homomorphism which commutes with all the elements of 
Inn(G) a normal homomorphism. 

First, we show that if d is a normal automorphism, then there is a homomorphism 
w:G— Z(G) such that 6(g) =Wv(g)~1g for allg EG. 

To show this, first assume that ¢ is a normal automorphism. Define the map 
on G, wW(g) = g¢(g)~'. Notice that $(g) = w(g)~+g, as desired. Furthermore, w 
maps into Z(G). To see this, first notice that for allg € G, dig(g) = igb(g) which 
implies that ¢(g) = g¢(g)g-!, and so gd(g)~' = o(g)~'g. Second, notice that for all 
gt EG, Giga(xg*) =igrd(ag), and so o(g-'ag-*g) = g 'o(xg")g, and so 
(9) (x) = g-*o(x)(g)~"g, which implies that g6(g)* O(a) = 9(x)o(g)~'g. But 
then, by above, g¢(g)~'¢(x) = o(x)g¢(g)~+. Since @ maps onto G, we can conclude 
that for allh € G, g¢(g) th = hg¢(g)~+ which can be rewritten as )(g)h = hw(qg). 
This shows that ~ maps into Z(G). Having shown this, we can now prove that w is 
a homomorphism. Indeed, for all g,h € G, 


(gh) = gho(gh)-* = gho(h)-"6(g)~* = gb(h)o(g)* = 96g) 'b(h) = ¥(g)o(h). 


Now assume that Z(G) is trivial. We first show that the only normal automor- 
phism of G is the identity automorphism. Let ¢ be a normal automorphism of G. By 
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the work above, we know that w(g) = g¢(g)~* defines a homomorphism from G into 
Z(G) = 1, by assumption. Therefore, for allg € G, we have 1 = w(g) = gd(g)! 
and so ¢(g) = 9 for all g € G, i.e. @ is the identity automorphism. With this result 
in mind we can now show that Z(Aut(G)) is trivial. Take any ¢ € Z(Aut(G)), so in 
particular @ is a normal automorphism, and so by the work above ¢ is the identity 
automorphism. 


We leave it to the reader to show that, in fact, the map defined by ¢(g) = w(g)~'g, 
where ~ : G + Z(G) is a homomorphism, is necessarily a normal endomorphism. 
We remind the reader of another important subgroup of a group. 


Definition 6.2 Let X be a non-empty subset of a group G. The subgroup gener- 
ated by X, written (X) is the collection of all finite products of elements of X and 
their inverses. The set X is called the generating set of (X). 


If X = {91,92,---,;9n} a finite set, then we write (g1, 92,-.-,;9n) for (X). Note 
that if X = {g}, then (X) is simply the cyclic subgroup generated by g. One needs 
to check, of course, that (X) is indeed a subgroup of G. Furthermore, one can show 
that (X) is the smallest subgroup of G containing the set X. 


Example 6.1 Here, we list several examples groups and their generators. 
1. The Klein-4 group V = {e, a,b,c} is generated by X = {a,b}. 
2. The quaternions are generated by the set X = {i, 7}. 


3. The dihedral group D4 (rotations and reflections of a square) is generated by 
any single rotation and any single reflection. 


Before we go on to our next important subgroup (which will appear in the next 
section) we remind the reader of some notation. 


Definition 6.3 For g,h € G a group, the conjugate of h by g in G, written 
hI =g-thg. The conjugacy class of H < G, written HI = {h9 : he H}. 


We leave it to the reader to verify the following simple statements which attest 
to the fact that conjugation is a lot like exponentiation: 


Lemma 6.1 Let G be any group with H < G. Then 
1. For h,g,k EG, (hk)9 = h9k9 

2. Forh,g,k EG, (h*)\9 = hko 

3. Forh,g,k eG, (h9)7* = (h71)9 


4. ForgeéG, HI<G. 
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5. For hy,...,hn,g €G, (hi-++hn)o = ht. +h. 
6: Hor Mi tag SG, (geen age ts. 
We continue our discussion of several important subgroups of a given group. 


Definition 6.4 Let G be a group and g,h € G. The commutator of g and h, 
written 
[gh] =g tho 'gh = g*q". 
More generally, if 91,92,---;9n © G then the commutator of 91, 92,---,9n; 
written 
(91) 92,-+-59n] = | [91,92)--+59n—1],9n] (defined recursively). 


The following are some simple properties of commutators which the reader may 
wish to verify: 


Theorem 6.2 If g,h,k © G a group, then 
1. [g,h|-* = [h, 9] 


2. [g, h]* = [g*, h*| 

3. (gh, k] = [g, kh, Fe 
4. ee Allg, hl’ 
5. lah = flea) 
6. [g-*, h] = ([g,A]o 
7. 


Gh Re gl iho het 


The next result we shall prove in detail. First, we need some additional notation. 
For g,h,k € G define g!t* = g!g*. 


Theorem 6.3 Let g,h € G be a group and n € Z*. Then 
1. [g", h] = [g, blo” +9" C+ +941 
2. If |g, h] € Z((g, h)), then [g”, h] = [g, Al. 


Proof 6.2 We prove the first statement by induction and the use of Theorem 6.2.8. 
The base case, n = 1, is immediate. Forn > 1, 


= n—- n—2 n— ae g 
[g", A] = [9"*g, A] = [9"*, A] 19, A] = ([9, Ae tt t944)" Ug, a] = 
nee och) signe eh 


For the second statement, if |g,h] € Z((g,h)), then in particular |g, h| commutes 
with any power of g, and so 


[g, h]e ta ttt = Og, he Lg, Ie [9] = (9, Alla, h-- [9,4] = [9, AI”. 
The result then follows from the first statement. 
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Definition 6.5 Let G be a group and X and Y be two non-empty subsets of G. The 
commutator subgroup of X and Y, written 


[X,Y] =([z,y] : ce X,yeY ), 


the subgroup generated by the commutators |x, y|. More generally for X1, X2,...,Xn 
non-empty subsets of G, the commutator of X 1, X2,...,Xn, written 


[X1, X2,...,Xn] = [ [X1, Xe,.-.,Xn-1], Xn] (defined recursively). 


Remark 6.2 For the neat result, we remind the reader of some definitions and results 
we saw earlier in the text. 


1. For X and Y subsets of a group G, the set XY is the collection of all products 
of an element of X and an element of Y. 


2. If H,K <G a group, then the HK and KH are subgroups of G exactly when 
AK = KH, which is true in the special case that H and K are both normal 
subgroups of G. 


Theorem 6.4 [If H, K and L are normal subgroups of a group G, then |HK, L] = 
[H, L]|K, L}. 


Proof 6.3 First note that since H, K and L are normal in G, so are |HK, L], [H, L| 
and [K,L]. For instance, forh € H,1€ L andg € G, we have [h,l]9 = [h9,19] € 
[H,L] which shows that |H,L| 1G. By the remarks above, this in turn implies that 
(A, L|[K, L] <G. 

Hence, to show that |HK, L] C |H, L][K, L], it is enough to show that each [hk, l] € 
[HK, L] is also in |H, L||K, L]. To see this, note that 


(hk, l] = [h, U]*[k, 1] = [A¥, U*][K, U] € (H, L][K, L). 


Next, to show that |H, L|[K,L] C [HK, L], first note that since [H, L],|K,L] <1 
G, we have that |H, L|[K, L] = [K, L||H, L] and [H, L),[K,L] < G. Therefore, it’s 
enough to show a single product [h,||[h,l'] is in |HK,L] where hh € H andl,l' € L. 
Now to see this, note that for any k € K, using Theorem 6.2.8, 


[h, fh, 0] = k{h, TFA, UR? = ((hke, hk, UY) 
and this last expression is in |HK, L] being a normal subgroup of G. 


Definition 6.6 For a group G, the derived subgroup of G (or sometimes called 
the commutator subgroup of G), written G’ = [G,G]. 


Using Theorem 6.2.1, one sees that G’ is, in fact, the collection of all finite products 
of commutators in G (no need for their inverses). Another easy fact to verify is that 
G’ is characteristic in G, and hence normal in G. Here are some additional properties 
which we prove in detail: 
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Theorem 6.5 Let G and K be groups and H,L be subgroups of G. Then 
1. If6:G—>K a homomorphism, then ¢(G’) < kK’ 
2.G'<H iff HAG andG/H is abelian. 
3. G' is the smallest normal subgroup of G which will form an abelian factor group. 
4. If L/H < Z(G/H), then |G, L] < H. 


Proof 6.4 For the first statement, take any |g, hi]--+[9n, hn] € G'. By properties of 
a homomorphism, 


([91, ha} +++ [In Pn]) = O([91, al) +++ (Ins Pn]) = [6(91), O21) ++ 16(Gn), O(Pn)] € K". 


For the second statement, first assume that G’ < H. Then for all g € G and 
he H, we have 
g ‘hg = h(h-*g"*hg) = h[h, 9] € A, 


and so HG. To see that G/H is abelian, notice that for gi, 92 € G, 
HHH = gHoHg, Hoy Ho HH = gH gH |g1, g2|H = 92H gH. 
Now assume that H AG and G/H is abelian. Then for all gi, 92 € G, we have 
9, Hg Ho HoH = gy Ho Hoy! HoH = H, 


and so |g1,92|H = H. Hence, |g1, 92] € H and thus by the remark after the definition 
of G', it follows that G’ < H. 

The third statement follows immediately from the second statement, since it says 
in particular that any normal subgroup H of a group G with G/H abelian necessarily 
contains the derived group G’. 

For the fourth statement, since L/H < Z(G/H) we have that gH|H =lHgH for 
allg €Gandle€L. This can be rewritten as [g,l|H = H and so |g,l| € H. 


EXERCISES 


1 Prove the following results listed in Remark 6.1: 


a. Z(G) is always an abelian normal subgroup of G. 


b. Z(G) is characteristic in G (a subgroup H is characteristic in a group G if 
for every automorphism @ of G, we have ¢(H) < H). 


c. The center of U3(F’) , strictly upper triangular 3x3 matrices, are the matrices 


1 0a 
of the form | 0 1 O |, whereaé F. 
00 1 


d. The center of the dihedral group D4, is the subgroup containing the identity 
and 180° rotation. 
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6.2 


e. Consider the epimorphism v: G — Inn(G) defined by 1(g) = ig, where ig is 
the inner automorphism defined by i,(x) = grg~'. Then ker(v) = Z(G) and 
so G/Z(G) = Inn(G). 


f. For any group G and g € G, we have that g € Z(Coe(g)) where Ce(g) = 
{aeéG : ga=ag} (hence, every element of a group G lies in an abelian 
subgroup of G). 


g. Z(Sym(X)) =1 when |X| > 3 


Prove that if H < K < L<G, H is characteristic in K and K is characteristic 
in L, then H is characteristic in L. 


Show that in the proof of Theorem 6.1 the map defined by ¢(g) = w(g)~‘g, 
where = : G — Z(G) is a homomorphism, is necessarily a normal endomor- 
phism. 


Prove Lemma 6.1 
Prove Theorem 6.2 


Explain why in the proof of Theorem 6.4 in order to prove [H,L|[K,L] C 
[HK, L], it’s enough to show a single product [h,l|[h,l’] is in [HK, L] where 
he Handl,l' € L. 


Prove that G" is characteristic in G, for any group G. 


SERIES OF GROUPS 


We now consider chains of subgroups within a group, called series in order to even- 
tually define solvable and nilpotent groups. First, we need to distinguish between 
several kinds of series and then we will narrow our focus to particular examples of 
series. 


Definition 6.7 Let Hp C H, C --- C Ay be subgroups of a group G with Hp = 1 
and H, =G. 


1. A subnormal series has the property that H;< Aj. for alli =0,1,...,n—1. 


2. A normal series has the property that H; 1G for alli =0,1,...,n. 


3. An abelian series is a subnormal series with the additional property that 


Z. 


Hy41/H; ts abelian for alli =0,1,...,n—1. 


A central series is a normal series with the additional property that Hj.1/H; C 
Z(G/H;) for alli =0,1,...,n—-1. 


The H; are called the terms in the series and n is called the length of the series. 


Remark 6.3 We make some simple observations. 
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1. Clearly, every normal series is also subnormal (but not vice-versa). 


2. KAHAG does not necessarily imply K 1G, however if K is a characteristic 
in H< 1G, then K AG. 


Example 6.2 Here, we list some examples of series. 


1. The following is a normal series in the abelian group (Z,+) for any positive 
integers a and n: 
10) <2 <a? 2 = ae <= 7, 


For instance, if we take a= 2 andn =4 we get the normal series 


{0} < 16Z < 8Z < MZ, < 27, < ZL. 


These series are certainly abelian and central as well. 


2. The following is a subnormal series in the group D4, the dihedral group: 
{pot S {po ta} S {Po, pa, Ma, M2} < Da. 


We generate certain series which will be used in defining nilpotent and solvable 
groups, but first, we need to define some additional subgroups. 


Definition 6.8 Let G be a group, then 


1. For any natural number n, the n-th derived subgroup, written G™, is de- 
fined recursively as follows: G) =G, GY =G!’ and GY = (GMY’. 


2. For any natural number n, G” is defined recursively as follows: G° = G, 
G! =G! =[G,G] and G*! = [G,G"]. 


3. For any natural number n, the n-th center of G, written Z,(G) is defined 
recursively as follows: Z(G) = 1, 21(G) = Z(G) and Zryi(G) = {g € 
G | [Gg] < Z(G}. 


Remark 6.4 It should be clear to the reader that G°*) aG™ and that G™/Ge+) 
is abelian. Here are some other facts about the n-th derived subgroup. 


Theorem 6.6 For any group G and m and n natural numbers, 

1. G™ is characteristic in G (and so G™ 4G). 

2 (Gm =Ee™., 

3. IfH<G, thn HM < Gm. 
Proof 6.5 We prove the first statement by induction. For the n = 0 case GO = 
G which is certainly characteristic in itself. For n > 0, since G™ = (Gedy is 
characteristic in G’-)) which, by induction, is in turn characteristic in G, it follows 


that G) is characteristic in G (see Exercise 2 in Section 6.1). 
The remaining statements are left to the reader to prove. 
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Definition 6.9 The series, G = GO > GY > G?) > --- is called the derived 
series 


Note that if a derived series terminates in 1 it is then an abelian normal series. 


Example 6.3 S,, has a derived series terminating in 1 when n < 5 (left as an 
exercise). 


We need a lemma before we can prove some facts about G”. The proof of this 
lemma will be omitted since it would require us to first introduce several definitions 
and to prove several results which will take us too far off the beaten path. 


Lemma 6.2 Let H,K,L be subgroups of G. If two of the subgroups |H, Kk, L}, 
[K, L, H], |L,H, K| are contained in a normal subgroup of G, then so is the third. 


Here, we list some facts concerning G”. 


Theorem 6.7 For any group G and n a natural number, 
1. G” is characteristic in G (and so G" 1G). 
DOGO BGG, 

2. (G" Gy Ge™ 
Zp. (Ge)? =e, 


Proof 6.6 We prove the second and third statement and leave the first and fourth 
as exercises. To prove the second statement, since Gr-! < G, it is enough to show 
a single commutator [g,h| € G" (where g € G and hh € G"“') is also in G"!. Now 
[g,h] =h-9h € G"“!, since, by the first statement, G-' dG. 

We prove the third statement by induction on m. The case m = 0 follows 
immediately from the second statement. For m > 0, by Lemma 6.2, [G™,G"| = 
[G"-!,G,G"| is contained in the product [|G,G",G""][G",G""!,G] (G° 4G im- 
plies that [G,G",G—"][G", G"-1, G] is a normal subgroup of G). Now, by induction, 
we know that [G,G",G™][G",G™ 1,4] < qr, 


Definition 6.10 The series, G=G° > G! > G?>.--- is called the lower central 
series 


Note that if a lower central series terminates in 1 it is then (as we shall see) a central 
normal series. 


Example 6.4 If G = B3(F), upper triangular matrices over a field F, then G = 
G° > G! = G' > G? = 1 forms a lower central series. 


Here are some facts concerning Z,,(G): 


Theorem 6.8 For any group G and n a natural number, 
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1. [G, Z,_-1(G)] < Z(G). 

Zm(G/Zn(G)) = Zmtn(G)/Zn(G). 

Z(G) is characteristic in G (and so Z,(G) 1G). 
Z(G) < Zn4i(G). 


ao fe f & 


IG" Z(G) Zale) eS me 


Proof 6.7 We prove the second statement by induction onm. Them = 0 case follows 
immediately from the definition (convince yourself). Form > 0, by induction, 


Zm(G/Zn(G)) = {9 © G/Zn(G) | |G/Zn(G), 9] < Zm—1(G/Zn(G))} 
= {9 € G/Zn(G) | |G/2n(G), g] S Zmtn—1(G)/Zn(G)} 
={9€G | [Gg] < Zm4n-1(G)}/Zn(G) = Zm+n(G@)/Zn(G). 
We leave the rest of the theorem as an exercise. 


Definition 6.11 The series 1 = Z(G) < 21(G) < Z(G) <--- is called the upper 
central series 


Note that if it terminates in G it is then a central normal series. 


Example 6.5 We give several examples of an upper central series. 


1. IfG=Dy4, then 1 = Z(G) < Z(G) = {po, p2} < Z(G) = Dg forms an upper 
central series. 


2. If G= U3(F), then 1 = Z(G) < Z(G) < Zo(G) =G forms an upper central 


series. 


3. G = B3(F) does not form a upper central series, since Z\(G), Z2(G),... are 
each the collection of non-trivial scalar matrices (left as an exercise). 


The next result connects the structures G” and Z,(G) and is the basis for the 
terminology upper and lower. 


Theorem 6.9 For and group G, if for some natural number n we have G" = 1, then 
fort =0, 100.5%) (GP < ZAG). 


Proof 6.8 We prove this by induction oni. For i= 0 the statement to prove becomes 
1 <1 which is evidently true. For 0 <i <n, notice that [G,G"™] = Grtth = 
Gr") which by induction is contained in Z;4(G). But then by the definition of 
Z(G) it follows that G’ < Z(G). 
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EXERCISES 


1 Give an example showing that K < H <G does not necessarily imply K <G. 
2 Prove that if K is a characteristic in H dG, then K dG. 

3 Verify that Example 6.2.2 is an abelian series. 

4 Verify that G+) aG™ and that G™/G@+ is abelian. 

5 Prove Theorem 6.6.2 & .3. 

6 Verify that S;, has a derived series terminating in 1 when n < 5. 

7 Prove Theorem 6.7.1 & .4. 


8 Prove Theorem 6.8.1,.3,.4 & .5 (note that the proof of part 4 follows a similar 
pattern to the proof of part 3 in Theorem 6.7). 


9 Verify for the group G = B3(F’) that 71(G), Z2(G),... are each the collection 
of non-trivial scalar matrices. 


6.3. SOLVABLE AND NILPOTENT GROUPS 


We have now reached the goal of this chapter, namely the study of solvable and 
nilpotent groups. These two classes of groups are a generalization of abelian groups 
in the sense that both can be constructed from abelian groups via a finite number of 
extensions. 


Definition 6.12 A group G is called solvable if it has an abelian series; recall that 
this means there is a subnormal series 1 = Go < G, <--- < G, = G with each 
Giai/G; abelian. The length n for the shortest such series is called the solvability 
class of G (or G is called a solvable group of class n). A solvable group of class 
2 is called a metabelian group. 


Certainly, a solvable group of class 1 is an abelian group. The following result 
reveals the tenacity of solvable groups. 


Theorem 6.10 The following are true: 
1. Every subgroup of a solvable group is solvable. 
2. The image of any homomorphism with solvable domain is solvable. 


&. Every factor group of a solvable group is solvable. 


Proof 6.9 Let G be a solvable group. Thus G has a subnormal series 1 = Go < Gy < 
+++ <G, =G with each Gi41/G; abelian. To prove the first statement, consider the 
Series 

1=HANG)<ANG, <::-<HANG, =H. 
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One can easily verify that the series is subnormal and by the Second Isomorphism 
Theorem, each 


AN Gini /HO G; = (AM Gi41)G;/G; < Gini /Gi- 


In other words, HN Gij41/HOAG is isomorphic to a subgroup of an abelian group 
and is therefore abelian as well. Hence, 1= HNGo< HANG, <:-:--< HAG, =H 
is an abelian series and so H is solvable. 

To prove the second statement, consider the series 


1 = G(1) = (Go) < O(Gi) <--- < o(Gn) = OG). 


Certainly, the subnormal property is preserved by the homomorphism and by the 
First and Third Isomorphism Theorem each quotient 


(Gis1)/O(Gi) = (Gitr/ker(¢))/(Gi/ker(¢)) = Gita /Gi, 


which is assumed to be abelian. Hence, 1 = $(1) = (Go) < (G1) < +--+ < O(Gn) = 
o(G) is an abelian series and so ¢(G) is solvable. 
To prove the third statement, consider the series 


{N} = Go < GiN/N <---<GaN/N =G/N. 


By the Third Isomorphism Theorem (part 1), each G;N/N < Gi4iN/N, since 
GN and N are both normal in Gi41N (check). By the Third Isomorphism Theorem 
(part 2), 

(Giti N/N)/(GiN/N) = GiziN/GiN = Gi41/Gi, 
which we know to be abelian. Hence, {N} = Go < GIN/N <---<G,N/N = G/N 
is an abelian series for G/N and so G/N is solvable. 


Definition 6.13 A group G is called solvable if it has an abelian series; recall that 
this means there is a subnormal series 1 = Gop < Gi <--- < G, = G with each 
Giii/G; abelian. The length n for the shortest such series is called the solvability 
class of G (or G is called a solvable group of class n). A solvable group of class 
2 is called a metabelian group. 


Definition 6.14 A group G is called nilpotent if it has a normal series 1 = Go < 
Gi <-++ < Gz = G with each Gi41/G; < Z(G/G;). The length n for the shortest 
such series is called the nilpotency class of G (or G is called a nilpotent group 
of class n). 


Certainly, a nilpotent group of class 1 is an abelian group. Nilpotent groups have 
a similar tenacious result. Since the proof is very similar in nature to the proof just 
given, we leave it to the reader as an exercise to provide the proof. 


Theorem 6.11 The following are true: 


1. Every subgroup of a nilpotent group is nilpotent. 
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2. The image of any homomorphism with nilpotent domain is nilpotent. 


Remark 6.5 We need to point out that not every solvable group is nilpotent. Take 
for example the group G = S3 which is solvable (since it has a derived series — see 
Exercise 6 of Section 6.2), yet is certainly not nilpotent. Since, Z(G) = 1, then 
for all i, by Thereom 6.8.2, Z(G/G;) = Z(G)/Z;(G) = 1. Therefore, for all i, we 
cannot possible have Gi41/G; < Z(G/G;) = 1. Yet it is certainly the case that every 
nilpotent group is solvable (convince yourself). In fact, we will see later that if a group 
is nilpotent of class n, then it is also solvable of class <n. 


Lemma 6.3 If G is a p-group, then G is nilpotent. 


Proof 6.10 We prove this result by induction. Assume every p-group of order less 
than the order of G is nilpotent. Since Z(G) 4 1, this implies G/Z(G) is p-group of 
smaller order than G and therefore is nilpotent. Consider the canonical homomor- 
phism v : G— G/Z(G). The the preimages of the central series of G/Z(G) form a 
central series for G, thus making G nilpotent. 


We wish now to connect the notions of solvability and nilpotency with some of 
the specific series we defined in the previous lesson. This will make the study of these 
classes of groups more concrete in the sense that we can prove more results about 
these groups, in particular about their solvable and nilpotent classes. 


Theorem 6.12 For a group G, 


1. If G has an abelian series 1 = Go < Gi < ++: < Gn = G then for alli = 
0,1,...,n we have GO ..2, 


2. G is solvable of classn iff G™ =1 and G°-) #1, for some n. 


&. If G has a central series 1 = Go < Gi < +--+ < Gy = G then for alli = 
0,1,...,n we have G; < Z;(G). 


4. G is nilpotent of class n iff Z,(G) = G and Z,_\(G) 4 G, for some n. 


Proof 6.11 We prove the first statement by induction. For 1 = 0, the statement is 
self evident. For i > 0, by induction, G@ = (GY)! < (Gz-@1)’ S Gui, since 
Gr—(i-1)/Gn—i ts abelian (see Theorem 10.9.2). 

To prove the second statement, first assume that G is solvable. So then G has an 
abelian series 1 = Go < Gy < +--+ < Gy = G and let this be the shortest such. By part 
1, it follows then that G™ < Go = 1. Furthermore, G"-) 1, for otherwise the 
derived series would be shorter than our assumed shortest abelian series witnessing 
the solvability of G. Now assume that G™ = 1 and G°-) ¥ 1, for some n. Then 
the derived series G = GD > GY > G2) >... > G™ = 1 is an abelian series 
witnessing the solvability of G. By the direction just proved and the fact that G@"-) F 
1, it must be the case that the derived series is the shortest abelian series for G 
(convince yourself) and hence n is the solvability class for G. 

We leave the third and fourth statements to the reader to prove in an analogous 
manner to the first and second. 
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Example 6.6 From the examples presented in the last section, we know then that 
1. S, is solvable for n < 5. 


2. We will show later in the text that for primes p,q,r if G is a group of order p™ 
or pq or p*q? or pgr, then G is solvable. 


3. Burnside proved a more general fact that if G has order p™q”, then G is solvable. 
The most general fact written in an amazing article of incredible length by Feit 
and Thomas is that any group of odd order is necessarily solvable. 


4. The first four smallest non-solvable groups have orders 60, 120, 168 and 180. 
The non-solvable group of order 60 = 2? -3+5 is As. 


5. B3(F) is nilpotent of class 2. In general, B,(F’) is nilpotent of class n — 1. 
6. D4 is nilpotent of class 2. 
7. U3(F) is nilpotent of class 2. 
Theorem 6.13 [f G is a nilpotent group of class n, then G is solvable of class <n. 


Proof 6.12 We have already pointed out that nilpotency implies solvability, so it 
remains to show the statement regarding the class. So in addition, assuming G has 
a central series of shortest length n gives us an abelian series of length n which may 
or not be the shortest such. Hence, the best we can conclude is that G is solvable of 
class< n. 


These next series of results put the subgroup G” in proper perspective. 


Lemma 6.4 If G is a nilpotent group of class n, then G? < Z,_;(G) for i = 
O,1,...,n. 


Proof 6.13 The proof is by induction. The i = 0 case follows immediately from 
Theorem 6.12.4. Forti > 0, using induction and Theorem 6.8.1, 


Gi =[G,G™"] < [G, Z,-@-1)(@)] = Zn-G-2(@) S$ Zn-i(@). 
Lemma 6.5 If G" =1, then G is nilpotent of class <n. 


Proof 6.14 By assumption and Theorem 6.9, we know that G?-' < Z(G) fori = 
0,1,...,n. So if we seti=n we get G=G° < Z,(G). Hence, Z,(G) = G and so by 
Theorem 6.12.4, G is nilpotent of class <n. 


Theorem 6.14 A group G is nilpotent of class n iff G? =1 andG"-!¥41. 


Proof 6.15 Assuming G is nilpotent of class n, by Lemma 6.4, G" < Z(G) = 1 
and so G" = 1. Furthermore, G’-! 4 1 for otherwise, by Lemma 6.5, G would be 
nilpotent of class <n—1, a contradiction. 

Assuming that G’ = 1 and G"-! £1, by Lemma 6.4, G is nilpotent of class <n. 
Now G cannot be nilpotent of class i <n, for otherwise, by the reverse direction just 
proved we would have G’ = 1, contradicting the fact that G’-! £1. 
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EXERCISES 


1 Prove Theorem 6.11 


2 Prove that every nilpotent group is solvable. 


3 Prove Theorem 6.12.3 & .4. 


4 Determine the solvability class of S, for n < 5. 


5 Verify the following: 


a. 


b. 


B3(F’) is nilpotent of class 2. 
In general, B,,(F’) is nilpotent of class n — 1. 
Dg is nilpotent of class 2. 


U3(F) is nilpotent of class 2. 
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Ring Theory 


N THIS CHAPTER, we now study a set with two operations as opposed to groups 
| which have one operation. This, of course, makes the study both more complicated 
and more rich not only because of the second operation added, but also because of the 
interaction between the two operations. As we shall see, rings have many structures 
and concepts similar to groups such as subring, ring homomorphism and quotient 
ring and ideals (as opposed to normal subgroups). 


7.1. DEFINITION AND EXAMPLES 


We now begin the study of rings by presenting basic definitions and lots of examples 
to back up these definitions. 


Definition 7.1 A ring is a set R together with two operations which we shall denote 
by the symbols of + and - which satisfy the following properties: 


1. (R,+), denoted by R*, forms an abelian group. 

2. (R,-) satisfies the closure and associative properties of a group. 
3. (R,+,-) satisfies the distributive property, namely for all r,s,t € R we have 
r-(s+t)=r-s+r-tand(r+s)-t=r-t+s-t. 


A commutative ring is a ring in which in addition (R,-) satisfies the commu- 
tative property. A ring with unity is a ring in which in addition (R,-) satisfies the 
identity property of a group. 

We will denote this unity in a ring with unity by 1 and we sometimes call this 
ring a ring with 1. One can show as we did for groups that this 1 is unique when it 
exists. We denote the additive identity by 0 and the additive inverse of a ring element 
r by the notation —r. 


Example 7.1 Here, we list some examples of rings. The ring properties are easily 
verifiable. 


1. Z,Q, R and C with + and - being the familiar addition and multiplication each 
form a commutative ring with unity. 
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Figure 7.1 The addition and multiplication table for Z,. 


2. Let Zp = {0,1,...,n —1} with addition and multiplication modulo n. Then Z 
is a commutative ring with 1. For instance, Figure 7.1 present the addition and 
multiplication tables for Zq: 


8. For an integer n consider nZ, all integer multiples of n with the same operations 
as Z. Then forn #0 we have a commutative ring without unity. 


4. Let M,(R) be the collection of nxn matrices with entries from some ring R and 
define matrix addition and matrix multiplication in the usual linear algebraic 
way making use of the operations from the ring R. Then M,(R) is a ring (non 
commutative) with unity (when R has unity) and is called a matrix ring. 


5. Let R\x| be the collection of polynomials whose coefficients come from a ring 
R and define polynomial addition and multiplication in the usual algebraic way 
making use of the operations from the ring R. Then R[x] is a ring called a 
polynomial ring. 


6. Let F(X, R) be the collection of functions from a set X to a ring R and define 
addition and multiplication of functions in the usual way, te. (f + g)(x) = 
f(x)+9(2) and (f-g)(x) = f(x)-g(x) using the operations in the ring R. Then 
F(X, R) is a ring which is commutative and with unity if R also has those 
properties. 


7. Let (G,+) be an abelian group and let End(G) consist of all the endomorphisms 
of G. Define addition and multiplication as follows: 


(6+ ¥%)(9) = (9) + V(9) (@-¥)(9) = O(V(g))- 


The End(G) is a ring with unity (non commutative) where the unity is the 
identity map on G, t.e. 1g, called the ring of endomorphisms of G. 


&. Given rings Ry, Ro,... Ry we can define the cartesian product ring 
Ry x Rp x--+x Ry 


similar to the construction for groups where operations are defined coordinate- 
wise. 
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9. The ring R = {0,1} is called the trivial ring and R = {0} is called the zero 
ring. 


Definition 7.2 Let R be a ring with 1. An element r € R is a unit if it has a 
multiplicative inverse, i.e. there exists ans € R such thatrs =1= sr. The collection 


of all units in R are denoted by U(R). 


Remark 7.1 When r is a unit one can show in the usual way that its inverse is 
unique, so from now on we will denote the inverse of r by the suggestive notation of 
reciprocal, r~'. We leave it as an exercise to show that (U(R),-) forms a group. 


Example 7.2 In each of the examples below it is easy to compute U(R). 
1. U(Z) = {-1, 1}. 


2. U(Q) = M, U(R) = R* and U(C) = C* 

4. U(End(G)) = Aut(G). 

5. U(Z,) ={meZ, : gcd(m,n) = 1}. 

Using the multiplicative operation in a ring we can define exponentiation in the 
natural way. For r € R and n a positive integer, we define r™ =r-r----- r (n times). 
If R has unity, then r° = 1. Ifr isa unit, then r~" =r-t-r-t..--. r—! (n times). We 


define an additive exponentiation using the additive operation in the ring. In other 
words for r € R and n a positive integer, we define nr = r+r+---+r (n times), 
Or = 0 and (—n)r = (—r) + (—r) +--- + (1) (n times). The standard properties of 
exponentiation hold here, namely, for r € Ra ring a m,n € Z, we have 


qe Ge eae mr+nr =(m+n)r n(mr) = (nm)r. 
Lemma 7.1 Letr ands be elements of a ring R. Then 


1. r0=0=0r. 


T 
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Proof 7.1 We leave the proof of these statements as a nice exercise. 


Definition 7.3 Let (R,+,-) be a ring. A non-empty subset of S C R is a subring 
of R, written S < R, if (S,+,-) is a ring (using the same two operations). 
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Remark 7.2 As with groups and subgroups, there is a short cut for determining 
when a non-empty subset is a subring. One simply shows that for allr,s € R we have 
r—s€Randrs € R. The proof of this remark follows quickly from the one given 
for subgroups. 


Example 7.3 Here, we list several examples of subrings. 


1. For any integer n we havenZ<Z<Q<R<C. For instance, to see that 
nZ < Z, using the shortcut, for nk,nl € nZ, we have 


nk—nl=n(k-D E€nZ and nk-nl =n(knl) € nZ. 


Since (nZ, +) are the only subgroups of (Z,+) it follows that (nZ,+,-) are the 
only subrings of (Z,+,-) 


2. Differentiable real-valued functions are a subring of continuous real-valued func- 
tions which are a subring of real-valued functions. The notation we will use for 
these rings are 


D(R) < C(R) < F(R). 
8. For any ring R, the center of R, written 
Z(R)={reR : rs=sr for alls € R}. 


For instance, one can show that Z(M,(R)) = {rln r € R}, te. scalar 
matrices. We leave the verification that Z(R) < R as an exercise. 


Definition 7.4 R is called a division ring if R is a ring with 1 4 0 and U(R) = R*. 
[f in addition R is commutative, then R is called a field otherwise it is called a skew- 
field. 

Analogous to subring, A non-empty subset S of a field (R,+,-) is a subfield if 
(S,+,-) is a field (with the same operations). 


Remark 7.3 We list a couple of remarks about fields. 


1. In other words, (R,+,-) is a field if both (R,+) and (R*,-) are abelian groups 
and R has the distributive property. 


2. The shortcut for checking S is a subfield of R is as follows: Show that for all 
r,s€S wehaver—seS andrs'eS. 


8. It is not an easy matter to construct a skew-field. In fact, it will be shown that 
a finite division ring is always a field. In a later section, we will construct the 
quaternions which is an example of an infinite skew-field. 


4. Q, R, C are examples of infinite fields and Z, for any prime p are examples 
of finite fields. Later we will see that the only finite fields ones of prime power 
order. 
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EXERCISES 


1 Verify that each of the rings presented in Example 7.1 are indeed rings. 
2 R=Z[V2]. Prove that m+n/2 €U(R) iff m?—2n? =+1 


3 For any ring R, prove that (U(R),-) forms a group. 


4 Let R be a ring with 0 4 1 and fix an rr € R. Suppose there is a unique s € R 
such that rs = 1. 


a. Show that r cannot be a zero divisor. 


b. Show that r must be a unit. 


5 Verify that the following subset of M/2(R) is a subring: 


6 Ra commutative ring such that a? = a for all a € R. Prove a+ a = 0 for all 
ace R. 


7 X aset and R= P(X) with A+ B=(AUB)—-(ANB) and A-B=ANB. 
a. Prove that R is a commutative ring. 
b. Write out the addition and multiplication table for R when X = {a,b}. 

8 Prove all parts of Lemma 7.1. 

9 Verify Remark 7.2. 


10 For any ring R, prove that Z(R) < R. 


11 Let R bea ring with unity. Prove that risa unit iff ds € R such that rsr =r 
and sr?s = 1. 


7.2 INTEGRAL DOMAINS 


The generic example of a ring from which perhaps the general definition arose is the 
integers. An additional property which the integers have is that the only way for a 
product of two integers to equal zero is if one of the factors is zero. Now this is a very 
useful property, for instance when solving a factorable polynomial equation such as 
x? —x—2=0 from basic algebra. One first factors it as (x — 2)(x +1) = 0 and since 
the product is zero it must be the case that either « —2 = 0 or +1 =0. Thus x = 2 
or x = —1 and we’ve solved the quadratic equation. In this section, we will generalize 


this property of the integers. 


Definition 7.5 An element r in a commutative ring R with 1 4 0 is called a zero 
divisor ifr £0 and there exists an s £0 in R such that rs = 0. 


170 m@ Fundamentals of Abstract Algebra 


Remark 7.4 Equivalently, r in a commutative ring R with 1 4 0 is not a zero 
divisor if whenever rs = 0 for some s € R, then either r = 0 or s = 0. This 
restatement of the definition in the negation can come in handy for proofs involving 
zero divisors. 


Example 7.4 In the ring Ze the number 2 is a zero divisor, since 2-3 = 0. Likewise 
8 is also a zero divisor. 


Definition 7.6 A ring is called an integral domain (which we shall abbreviate as 
ID) if 


1. R is commutative with 1 4 0, and 


2. R has no zero divisors. 


Example 7.5 Here, we list some examples on the topic of integral domains. 
1. The ring Ze is not an ID since it has zero divisors, such as 2 and 3. 


2. Z, Q, R, C and Z, (p prime) are all ID’s. 


Lemma 7.2 Let R be a commutative ring with 1 4 0. 
1. No unit in R can be a zero divisor. 


2. If R is a field, then R is an ID. 


Proof 7.2 Let r € R be a unit and suppose that rs = 0. Since r— exists we can 
multiply both sides of the equation by r~! to get r-*(rs) = r~10 which simplifies to 
s=0 and sor is not a zero divisor. 

If R is a field, then every non-zero element in R is a unit and hence R cannot 
have any zero divisors by the first part of this lemma. 


The converse of the second part of Lemma 7.2 is false, i.e. not every ID is a field. 
Take, for instance, the integers which is an ID but is not a field. However, the result 
is true under the assumption that the field is finite. 


Lemma 7.3 Every finite ID is a field. 


Proof 7.3 Let R be a finite ID. It’s enough to show that every non-zero element in 
R is a unit. Take r € R* and consider the map f : R* + R* by f(s) = rs. This 
function maps into R*, since s #0 implies that f(s) =rs #0. This function is also 
one-to-one, for suppose that f(s) = f(t) for some s,t € R*. Then rs = rt and so 
r(s —t) =0. Now r £0 and since R has no zero divisors it must be the case that 
s—t=0, te. s =t. Since R* is finite, by Lemma 1.2.2, f also maps onto R*. In 
particular, there exists an s € R* such that f(s) =1, i.e. rs =1 and sor is a unit. 


Definition 7.7 A non-empty subset S of an ID (R,+,-) is asubdomain if (S, +, -) 
is an ID. 
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TABLE 7.1. V means it has that property and x 
means it doesn’t 


commute - 
distribute - | Vv 


a 
2 
= = 
ay a 
g S| 2 
ec uPe ie alee 
EISlelels 
alFls|/ fia 
w/e} oo) eg| 42) e/% 
he IS Wwe (co? | ae ee 
Axiom SlOlmIOUlAlwW]S& 
{ closure+ [Vv |v [Vv |v |v flv fv 
[| assoc + “ARARATARARAR, 
| identity ++ [Vv [|v |v jv |v iv iv 
| inverse+ [V [Vv |v |v |v [viv 
[commute+|]vV [Vv |v |v iv [viv 
closure - Vivivilvl|lvl[vlv 
assoc - Vivivilv|vl[vilv 
identity - Viv|[Vv|vlv 
Vivilv 
v x |v 
ViviVviv 


| 
| inverse - 
| 
| 


“JS 
aN 


Remark 7.5 One can show that the shortcut for verifying subdomain is to show 
1é€S and for alla,b€ S botha—beES andabe S. 


Example 7.6 Here are some examples that relate to subdomains. 
1. The integers Z is a subdomain of the rationals Q since 1 € Z. 


2. Forn #1 the subring nZ of Z is not a subdomain of Z since 1 ¢ nZ. 


We end this section with a summary of the structures we have seen in this chapter 
thus far and the properties they each have (Table 7.1). 


EXERCISES 


1 Let R= {m+ 2ni : m,n © Z}. Verify that R is an ID by showing it’s a 
subdomain of Z[i]. 


2 Prove that F(X, R) is not in general an integral domain. 


3 Given an ID R, show that a non-empty subset S C R is a subdomain iff 1 € S 
and for alla,b€ S botha—be Sandabe S. 


4 Prove that given S is a subring of an integral domain R, if S has unity, then it 
must be the same unity that R has. 
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5 How many elements in an integral domain have the property that a? = a? 


6 Let R be a ring with |R| > 2 and for all a € Rx there exists a unique b € R 
such that aba = a. Prove that R must be a division ring. 


7.3. THE QUATERNIONS 


In this section, we carefully examine one example of a (necessarily infinite) skew- 
field called the quaternions discovered by the Irish mathematician William Rowan 
Hamilton. As the story goes he was so excited over his discover that he etched in a 
bridge he was crossing the basic relations which define this skewfield. 

We will give two different presentations of this skewfield, since each of the two 
presentations has its benefits. The first representation is matrix based, while the 
second is a more formal representation. 


Definition 7.8 The following set of matrices with the usual matrix addition and 
matrix multiplication is called the quaternions: 


a-{[-a 5] ss9ee} 


Theorem 7.1 The quaternions form a skewfield. 


Proof 7.4 Since Q C M2(C), we first check that Q is a subring of Mo(C). Set 


A=| A | ond B=| 2 a 
—-W1 41 —W2 22 


Then 
41 — 42 W1 — W2 41 — 22 Wi — W2 
A-B= a EQ and 
—W1+W2 21-22 —(wiy—we) 2% % 
2122 — W1W2 ZyW2 + W122 ] | 2122 — W1W2) We + W122 
AB= = EQ. 
—22W, — Z1W2 —W1W24 2122 —(zW2 + W122) 2122 — Wi We 


Q has unity, since 
1 0 
Ig = 5 a EQ. 


Every non-zero matriz in Q is a unit, since for any non-zero matrix 
zw a+bi c+d 
A — es = — 
—W Z 


—c+di a-—bi EQ, 
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the determinant is non-zero. Indeed, |A| = 22+ Ww = a? + b? +c? +d? > 0 when 
A#0 and 


1 _ J Z/A| -w/lAl ] _ 2/|A| —w/Al 
-| aA “Hal |= | cata aia [€2 


Finally, Q is non-commutative, since for instance one can check the following two 
matrices in Q do not commute: 


i 0 0 1 
Ee a ond | 4 A 
Hence, Q forms a skewfield. 


We now present an alternate more formal definition of the quaternions. Set 7, 7 
and k to be new symbols and define 


Q={at+bi+cj+dk : a,b,c,d€ R}. 
Addition will be defined component-wise, i.e. 
(a, +biiterjt+dik)+(a2+b2i+c2j+d2k) = (a1 +a2)+(b1+b2)i+(e1+¢e2)j +(di +da)k. 
In order to define multiplication, we first establish the following relations: 
P=fP=kh=-1, ij=-ji=k, jk=—kj =i and ki=—-ik =j. 


Indeed, these are the very relations which helped define the groups of quaternions. 
A mnemonic for remembering some of the relations is to consider 7, 7 and k as the unit 
vectors in 3-space and multiplication is cross product and use the right-hand rule. 
Using these relations, we can then define multiplication in Q by formal distribution 
and addition as defined above. 


Example 7.7 We will illustrate the two operations in Q by a couple of examples. 


1. We can add two elements of Q, 


(24+ 5—k)+(3i—k) =24+3i+ 5 -2k 


2. We multiply two elements of Q, 


(2+ 5—k)-(3i—k) = 6i — 2k 437i — jk — 3ki+ k* 


= 6 — 2k —3k —i—37 -1 = -14 5i— 37 —5k. 


Now although this representation of Q is easier to manipulate, the verification 
that this representation of Q is a skewfield requires more work since it does not sit 
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in any familiar setting having been formally defined (and we will omit this proof). 
Finally, the correspondence between the two representations is given as follows: 


_|a 0 oe bi 0 0 ¢ i 0 di 
| 0 a 0 —bhi —c 0 di 0 
1 O z 0 0 1 0 2 : 
=e]; fe = Joa[! | crertirer sat 
In other words, 
| <—> ae! } <> — d k++ os 
: 6 aay? = aa i 0|- 


The reader may wish to check that the basic relations for 7, 7 and k are satisfied 
for these three matrices. 


at+bt c+di 
—-ct+tdi a-—bi 


Definition 7.9 The following subset of the quaternions is called the Hamiltonian 
integers: 
H={at+bi+cej+dk : a,b,c,deEZ }. 


Remark 7.6 We list a few facts about the Hamiltonian integers. 


1. One can easily see that H is a subring of Q. 


2. The units of H, which we denote by Qg = {£1, +i, 47, +k} and is called the 
quaternion group. To see that these eight elements of H are indeed the units 
of H, first note that certainly each of the eight elements is a unit. For instance, 
i-| = —i. Now suppose that A = a+bi+cj+dk € U(H) (we switch now to the 
matrix representation of H). Then there is a B € H such that AB = Ig. Then 


1 = |I2| = |AB| = |A||B] = (e? +0? + 2 +.d?)|Bl. 


Hence, a? +b? +c?+d? =1. Since a,b,c,d € Z, this implies that either a = +1 
andb=c=d=0, b=+1 anda=c=d=0, c=+l anda=b=d= 0), 
ord=+1 anda=b=c=0. These eight cases correspond precisely the eight 
elements listed in Qs. 


We end this section with a diagram illustrating all the structures we’ve seen so 
far including examples for each type (Figure 7.2). 


EXERCISES 


1 Check the following two matrices in the quaternions do not commute: 


fot} [ao] 
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Ring 
with unity Division Ring 
Skewfield 


quaternions 


Integral Domain =z 


Commutative 


Figure 7.2 The world of rings. 


2 Consider the quaternions Q = {a+ bi+cj + dk | a,b,c,d € R}. Describe the 
elements of Q which commute with i (under multiplication). 


3 Verify that the center of the quaternions consists of real scalar matrices, i.e. 
Z(Q) = {alg : aE R}. 


4 If we set 


Verify the following relations: 


Po=fP=hR=-1, ij=—-ji=k, jk=—kj =i and ki=—ik =}. 


5 Verify that the Hamiltonian integers form a subring of the quaternions. 


7.4. RING HOMOMORPHISMS 


Just as we had a notion of a group homomorphisms we now define a ring homo- 
morphism. The parallels between the two algebraic structures groups and rings with 
regards to homomorphisms are extensive. 
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Definition 7.10 Let (R,+,-) and (R’,+’,-’) be two rings. A function @: R—- R' is 
a ring homomorphism jf for all r,s € R, we have 


o(r +s) = G(r) +' G(s) and o(r-s) = d(r) “ (s). 
Example 7.8 We list here several examples of ring homomorphisms. 


1. Consider the ring C and the subring of M2(IR) defined by 


The map 6: C > R by 


(ax= Bs al 


a 


is a ring homomorphism, since 


Gl(a + bi) + (c+ di)] = plato) + (b+ ai] = 04a) a 
= ae zs i : = o(at bi) + d(c + di). 


G[(a + bi)(c + di)] = d[(ac — bd) + (ad + be)i] = Beer aay 


a b c d 
-| 5 le (| = ote + tate ao, 


2. The map ¢:Z—- Z, by o(m) =r, wherem=ngq+r and0<r<n, is a ring 
homomorphism. 


3. Let R and R’ be rings, and choose anr € R. The map ¢: F(R, RB’) > R’ by 
o(f) = f(r) is a ring homomorphism and is called the evaluation homomor- 
phism. 


4. We have the usual identity ring homomorphism, 1p: R— R by 1r(r) =r for 
allr € R and the trivial (or zero) ring homomorphism Opp : R - R' defined 
by d(r) = 0! for all r € R, where 0! is the additive identity in the ring R’. 


Definition 7.11 Let ¢: R— R' be a ring homomorphism. The kernel of ¢, written 


ker d={reR: d(r)=0'}. 
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Example 7.9 Here, we discuss the kernel of specific examples of ring homomor- 
phisms. 


1. For the map given above in Example 7.8.1, one can check that the kernel is 
trivial, i.e. ker @ = {0}. 


2. For the map given above 6: Z— Zn, the kernel is nZ. 


3. The kernel of the evaluation homomorphism is the collection of all functions 
which take on the value 0 at r. 


4. kerlpz = {0} and kerOrpe = R. 


The next result lists a few immediate consequences of the definitions given thus 
far in this section. Some can be verified by appealing to group theoretic results we 
have already proved. 


Lemma 7.4 Let ¢: R > R' be a ring homomorphism andr € R. 
1. (0) =0' and ¢(-r) = —d(r). 
2. ker is a subring of R. 
3. @ is one-to-one iff ker d= {0}. 
4. O(R) is a subring of R’. 
Definition 7.12 Let 6: R— R' be a ring homomorphism. 
1. If b is one-to-one, then it is called a monomorphism. 
2. If 6 maps onto R’, then it is called a epimorphism. 


3. If @ is one-to-one and maps onto R’, then it is called a isomorphism and we 
say the rings R and R' are isomorphic and we write R& R’. 


Example 7.10 Let’s revisit Example 7.8. 


1. For the map defined above ¢: C > R we have already computed ker ¢ = {0} and 
so by Lemma 7.4 @ is one-to-one. Furthermore, it is clear from the definition 
of @ that it maps onto R. Hence, ¢ is an isomorphism and C = R. 


2. For the map ¢: Z— Zp we have seen the kernel is non-trivial and so ¢ is not 
one-to-one. However, it is immediate from the definition of d that it maps onto 
Zn, and so d is an epimorphism. 


3. 1p is an isomorphism and ORR ts simply a homomorphism in general. 
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EXERCISES 


1 Verify that Example 7.8.2,.3,.4 are indeed ring homomorphisms. 
2 Verify that the kernel of Example 7.8.1 is trivial. 

3 Prove the statements listed in Lemma 7.4. 

4 Prove the only ring automorphism of Z is the identity map. 


5 If Ris a commutative ring with unity and D is an integral domain and @: R > 
D is a ring homomorphism not the zero map, then ¢(1) = 1. 


6 Let 6: R—- R' and yw: R’ > R" be two ring homomorphisms. 


a. If ¢ is an isomorphism, then so is ¢-1. 


b. wood is a homomorphism. 


7.5 FACTOR RINGS AND IDEALS 


Analogous to the concept of a factor group and a normal subgroup in group theory 
are the factor ring and ideal subring in ring theory. Let (R,+,-) be a ring and Sa 
subring of R. The elements of the factor ring will be cosets of R modulo S$ using 
the additive structure of the ring (R,+). In other words, R/S ={r+S : se R}. 
We remind the reader of some elementary facts about cosets that are used again and 
again. 


1. ry tS=7re+S iff m1 €re+S iff Try —T2 € S. 
2.r+S=S8 iff res. 


We would like to define the ring operations for this new structure representative- 
wise, that is, 


(71 + S) + (r2 +: S) = (r1 +172) + S and (71, + S)- (r2 +: S) = (11-72) +S. 


As the reader will remember, the problem which can arise with these coset oper- 
ations is that they may not be well-defined functions (from R/S x R/S to R/S). If 
we can get beyond this hurdle, then R/S can be easily seen to form a ring which we 
call the factor or quotient ring of R modulo S. The additive identity in this ring 
will be S and should R have unity, then R/S will have unity 1+ S. 

Recall that the cosets of a group modulo a subgroup had a well-defined operation 
iff the subgroup was normal. Now, since (R, +) is an abelian group, we know (S,+) isa 
normal subgroup and so coset addition in R/S is always well-defined. We now present 
the condition which guarantees that coset multiplication in R/S is well-defined. 


Definition 7.13 A subring I of R is called an ideal, written I< R, if forallre R 
anda€ I, we have ra,ar € I. 
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+ 3Z 14+3Z | 2+3Z . 3Z 14+3Z | 24+3Z 


3Z 3Z 
1+ 3Z 1+ 3Z 
2+ 3Z 2+3Z 


Figure 7.3 The addition and multiplication table for Z/3Z. 


Lemma 7.5 Given a ring R and subring I of R. Coset multiplication for R/I is 
well-defined iff IQR. 


Proof 7.5 Assume first that I< R. Supposer +I =r'+I ands+I=s'+TI. Then, 
rer+tands'€s+TI and sor’ =r+aands'=s+b for somea,be€I. We want 
to show (r+ I)\(s+1I)=(r +1)(s' + J), te. rs+I=r's' +1 orr's' €rs+I. Now 
r's' = (r+a)(st+6) =rs+rb+as+ab and since I< R we know rb,as,ab € I. Thus, 
rb+as+abeTl and sors+rb+as+abers+TI. 

Now we assume that coset multiplication is well-defined. Fora € I andre R 
note that 


ra€rat+lI=(r+D(eat+D=(r+ DI =(r+HN04+)=7r04+7=04+7=1. 
Similarly, one can show ar € I and sold R. 


Corollary 7.1 Let R be a ring with subring I. The cosets of R/I with coset addition 
and multiplication forms a ring iff I< R. 


Proof 7.6 Most of the work has already been done and we leave the details to the 
reader as an exercise 


Example 7.11 Consider the ring Z and the subring 3Z. One can check that 3Z is an 
ideal and so we may consider the factor ring Z/3Z. The addition and multiplication 
table for this ring are given in Figure 7.3. 

It is no coincidence that these tables look remarkably similar to the tables for the 
ring Z3. Later in the text we shall show they are isomorphic as rings. 


Remark 7.7 The shortcut for checking that I< R is to check for alla,b € I we have 
a—bé€T1 and for allr € R we have ar,ra € I (or just ar € I if R is commutative). 
The reader should verify this as an exercise. 


180 m Fundamentals of Abstract Algebra 


Example 7.12 We now give various examples of ideals. 


1. nZ<Z (exercise). Recall from earlier that the only subrings of Z are subrings 
of the form nZ, and thus they are also the only ideals in Z. 


2. Consider the ring of functions F(X, R) from any set X to a ring R. Fix an 
xo € X and set I ={f € F(X, R) : f(xo) = 0}. Then Ia F(X, R), since if 
fig € I, then (f — g)(xo) = f(x) — g(a0) = 0-0 =0 and if h is any function 
from X to R, then (fh)(xo) = f(xo)h(xo) = 0+ h(xo) = 0 and in like manner 
(hf)(x0) = 0. 


3. Aut(G) < End(G) (exercise). 


4. Let R= M2(R) and I= {| 


for although 
w x a OO} _ 
yz ob 05| >" 


a 0 w 2 ii ties 
E ale lel ye |e! when x # 0. 


5. Ifé: R= R' be a ring homomorphism, then ker ¢ <1 R (exercise). 


| : a,bEeR \ Now I is not an ideal of R, 


aw+bx 0 
ay+bz 0 


| El, however 


6. Let R be a commutative ring and fix ana € R. ThenI = {ra : r € R} is an 
ideal in R called the principal ideal generated by a. The notation employed 
for such as ideal is I = (a) or I = Ra. Note that in Z all ideals are principal 
ideals, since nZ = (n) and as we pointed out these are the only ideals in Z. 


7. Let X be a subset of a commutative ring R. The ideal generated by X, written 
I = (X) is the smallest ideal containing the set X. If X = {r1,r2,...,1n} is 
finite then we write I = (r1,1r2,...,Tn). Note that the ideal generated by a single 
element of R is a principal ideal. 


8. For any ring R, I = {0} ts called the trivial ideal and R is called the im- 
proper ideal. 


Remark 7.8 We make several remarks many of which are left as exercises to verify. 


1. If Ris a ring with 1 and I is an ideal of R containing 1, then I = R. Indeed, 
for anyr © R wehaver=riel. 


2. If Ris a ring with 1 and I is an ideal of R containing a unit, then I = R, for 
suppose r € I is a unit. Then r~ exists and so 1 = r~'r € I, and by above 


I=R. 


3. A field has no proper non-trivial ideals. 


Ring Theory m 181 


4. Any ring homomorphism between two fields is either the zero map or a 


monomorphism. 


5. For any ideal I in a ring R, the canonical mapv: R—> R/I by v(r)=r+1 


is a ring homomorphism with kernel equal to I. 


6. A subset X of a ring R is an ideal iff X is the kernel of some ring homomor- 


phism with domain R. 


Theorem 7.2 (Fundamental Theorem of Ring Homomorphisms) Let ¢ 
R—- R' be a ring homomorphism and set K = ker ¢. Then R/K = @(R) and if 
@ is an epimorphism, then R/K & R’. 


Proof 7.7 Analogous to the case for groups, one now shows that the map UV: R/K > 
@(R) by U(r+ K) = G(r) ts a ring isomorphism. 


Example 7.13 Recall the epimorphism @: Z — Z, in Example 7.8.2. We found the 
kernel to be nZ and so by FTH, Z, = Z/nZ as rings just as they were as additive 


groups. 


EXERCISES 


1 


2 


Prove Corollary 7.1. 
Prove in the ring (Z,+,-) that nZ <4 Z for any integer n. 


Prove that J < R iff for all a,b € I we have a —b € J and for all r € R we have 
ar,ra Ee I. 


Prove that if @: R > R’ be a ring homomorphism, then ker ¢ < R. 


Prove that End(G) < Aut(G). 


Show for r1,72,.-.,% € Ra commutative ring that I = (r1,1r2,...,7n) can be 
written as 

{syry + Sorg +--+ +SnTn 2 $1, $2,---,8n € R}. 
Fix a prime p and set R= {f : p{b}andI={F ER : pla}. 


a. Prove that F is a ring. 

b. Verify that J < R. 

c. Verify that 6: R— Z, by $(¢) = ab~'(mod p) is a ring epimorphism. 
d. Compute ker®@. 

e. Apply the FTH to parts c and d. 


Prove if J, J< R a commutative ring, then [NJ <1 R. 


182 


10 


11 
12 
13 


14 


15 


16 


17 


18 


19 


20 


21 
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Let R be a commutative ring and a € R. Define J = {r € R : ra =0}. Prove 
Tae. 


Consider the ideals J, J <1 R a ring and define 
IJ = {a,b + agbg + +++ + Gnbp | each a; € I, each bj € J, n € Z*°}. 
Prove that IJ < R. 
Verify that a principal ideal is indeed an ideal. 
Verify that the trivial ideal is indeed an ideal. 
A field has no proper non-trivial ideals. 


Any ring homomorphism between two fields is either the zero map or a 
monomorphism. 


For any ideal J in a ring R, the canonical map v: R > R/I by vir) =r+1 
is a ring homomorphism with kernel equal to I. 


A subset X of a ring Ris an ideal iff X is the kernel of some ring homomorphism 
with domain R. 


Prove Theorem 7.2. 


Correspondence Theorem: Let ¢: R — R’ be a ring epimorphism. There 
is a one-to-one and onto inclusion preserving map between the ideals of R’ and 
the ideals of R containing ker ¢. 


Hint: send each ideal J in R containing ker ¢ to ¢(J) and each ideal I’ in R’ 
to ¢@ '(I'), then verify that 6(@71(’)) = I’, @1(¢(1)) = I, I C J implies 
#(I) C o(J) and I’ C J’ implies ¢-1(I') C @}(J’). 


Second Isomorphism Theorem: Let J be an ideal in a ring R and Sa 
subring of R. Then 


a. S+IAR 
b SNIAR 
c (S+D/l=S/(SNI). 


Third Isomorphism Theorem: If J and J are both ideals of a ring R with 
lod CRM (RID Gl) SA 
Let J and J both be ideals in a ring R. Define [J to be the smallest ideal 


containing all products of the form ab where a € I and b € J. 


a. Show that JJ consists of all possible sums a,b; + agbg +--+ + dnbn where 
each a; € I, each b; € J and n is any positive integer. 
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b. Give an example illustrating that it is not always the case that IJ is simply 
elements of the form ab where a € J and be J 
Hint: Look at the ring Z[z] with J = (2,2) and J = (3,2). 


c. Show lJ CIN J. 
d. JJ AJ in general. 


e. f+J=1. 
f. If K is another ideal of R, show that (JK) = (J)K and I(J + K) = 
be pik. 


7.6 QUOTIENT FIELD OF AN INTEGRAL DOMAIN 


Our goal in this section is to generalize the construction of the rational numbers 
from the integers and pinpoint the exact connection between the two structures as it 
relates to ring theory. First off, one can think of the integers as the generic example 
of an integral domain and the rationals as the quotients of integers which form a field 
— we will call this the quotient field. 

Let’s begin the general construction: Let R be an integral domain and define the 
following relation on the set R x R*: 


(a,b) ~ (c,d) iff ad = be. 


The reader should recognize this familiar relation, for it is the relation which 
identifies two fractions as being equal. This relation is, in fact, an equivalence relation 
which we prove now. 


1. Reflexive: (a,b) ~ (a,b) since ab = ba and R is commutative. 


2. Symmetric: If (a,b) ~ (c,d), then ad = bc and so cb = da (since R is 
commutative) which implies (c,d) ~ (a, 6). 


3. Transitive: If (a,b) ~ (c,d) and (c,d) ~ (e, f), then ad = be and cf = de. 
Notice adf = bcf = bde and so using commutativity and cancellation in R we 


have af = be which implies (a,b) ~ (e, f). 


We will define a fraction to be the class of an element in R x R*, ie. $ = [(a, )]. 
Then the elements of the quotient field of R, which we will denote as Q(R), will be 
the set of all fractions just defined. Now it should be of no surprise to the reader how 
we intend to define addition and multiplication in Q(R). 


a c_ad+be q ac ac 
Rea ahd ae ba. ba 


We have to be careful now for notice that we have defined two binary operations on 


equivalence classes and since an equivalence class has many difference representations 
we want to be sure that the operations are not dependent on the representations (i.e. 
the operations are well-defined). In other words, 
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Claim 7.1 The addition and multiplication defined above for Q(R) are well-defined. 


Proof 7.8 If § = w and § = c, then ab’ = ba’ and cd’ = dc’. Now to show 
addition is well-defined we need to verify that ad be = wd te or equivalently that 
(ad + bc)b'd' = bd(a'd’ + Uc’) or adb'd’ + bcb'd' = bda'd' + bdb'c which follows, since 
ay = ba’ and cd’ = dc’. To show multiplication is well-defined we need to verify that 


bi = ee or equivalently that acb'd’ = bda'c which follows also from the fact that 


ab! = ba’ and cd’ = dc’. 


To show Q(R) with these two operations forms a field is identical to showing that 
the rational numbers Q with the usual addition and multiplication is a field and so 
we withhold the proof. We point out though that ° is the additive identity and + is 
the multiplicative identity in Q(R) (again, to no one surprise). 

Now that we have constructed the quotient field of an integral domain we now 
wish that it has certain properties, namely that there is an isomorphic copy of R in 
Q(R) and Q(R) is the smallest field with this property. To say that Q(R) contains 
an isomorphic copy of R means formally that there is a monomorphism from FR into 
Q(R) and we say R embeds in Q(R). We are familiar with this property since the 
integers are contained isomorphically in the rational numbers, since any integer n can 

n 


be viewed as the fraction - 


Claim 7.2 R embeds in Q(R). 


Proof 7.9 Define the following map 7: R + Q(R) by x(a) = ¢. First of all, m is a 
ring with unity homomorphism, since 


m(a+ b) ate alte o > — n(a) 4+ 1(b), 


mab) = % = #6 = 28 = g(a)r(b) and 


Second, 7 is one-to-one, since 1(a) = 0 implies $ = ° and soal=1-0 ora=0. 
Hence, kerm = {0}. 


Now we show that Q(A) is the smallest field into which R can embed. The next 
claim states formally what we mean by Q(R) being smallest. Basically, it says that if 
R embeds in a field K, then, in fact, the entire quotient ring Q(R) can embed in K. 


Claim 7.3 Suppose R is an integral domain and there exists a field K and a ring 
with unity monomorphism ¢: R + K, then there exists a unique ring with unity 
monomorphism ¢* : Q(R) > K such that ¢* (¢) = (a) for alla € R, i.e. fom = 


where m is defined as in Claim 7.2. 


Proof 7.10 To show the existence of a map ¢* we define ¢* : Q(R) > K iG 
o(a)o(b)~*. First note that we have the desired property that ¢*(¢) = o(a 


<8 
So 

* 

“—— 

| ole 
al 
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¢(a) and so ¢* does indeed extend d. Now ¢* is a well-defined (this needs to be checked, 
since the domain of this map is equivalence classes) ring with unity monomorphism, 
SINCE 


Well-defined and One-to-one: 


$a Wf ad=be iff o(ad) = o(bce) iff O(a) 6(d) = 6(b)e(©) 


iff o(a)6(b)"! = 64d)" if 6 (2) = 6" (9) 

Ring with Unity Homomorphism: 

(a.) (4+ §) =o (*4) = o(ad + be)d(bd)-! = [4(a)4(d) 
+6(0)4(c)]o(b)-14(d)~ = 4(a)4(0)? + d(c)d(d)“* = 6" (8) + 6" (8) 
(b.) &* (24) = 0" (84) = o(ac)o(bd)-! = [6(a)o()]o(0)146(a) 

= 4(a)4(b)“'6(e) od)! = 6* (4) 6* (8) 

(c) #Q)=deayts1-it=1 


To show the uniqueness of ¢*, suppose w were another such monomorphism ex- 
tending ¢, then for any element of Q(R) 


o(@)-0@-)-#FG) )-@e(@) )-9@9@ 


= (ayo)! = 0° (4). 


In summary, the quotient field of an integral domain is the smallest field which 
contains an isomorphic image of the integral domain. One last observation we make is 
that a ring with 1 4 0 embedding in a field is a characterization of being an integral 
domain. 


Theorem 7.3 Let R be a ring with 1 4 0. Then R is an integral domain iff R 
embeds in a field. 


Proof 7.11 One direction of the proof follows from our work of embedding R in its 
quotient field. For the reverse direction, assume there is a field K and a ring with 
unity monomorphism ¢: R— K. To show that R is an integral domain it is enough 
to verify that R is commutative with no zero divisors. 

To show that R is commutative, take any a,b € R. In the field K we know that 
o(a)o(b) = o(b)¢(a) and so o(ab) = ¢(ba). Now since ¢ is one-to-one we get ab = ba. 

To show that R has no zero divisors, suppose that there were an a,b € R with 
ab = Or. Then 

$(a)4(b) = (ab) = 6(0r) = OK. 
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Now K being a field we know that it has no zero divisors and so either ¢(a) = 0x 


or o(b) = Ox. But then either a € ker@d or b € ker@ which is trivial, since @ is 
one-to-one. Therefore, either a =0 or b=0. 


EXERCISES 


1 For each of the following integral domain R, compute Q(R): 
a. R=Ziij. 
b. R=2Z[V2]. 


2 Show that if R is a field, then Q(R) = R. 


7.7 CHARACTERISTIC OF A RING 


Although the notion of ring characteristic is especially relevant for fields we introduce 
it now in more generality. 


Definition 7.14 The characteristic of a ring R, written char(R), is defined as 
follows: 


1. If there exists a positive integer n such that for allr € R we have nr = 
r+r+---+r=0, then char(R) is the smallest such positive integer. 
——[{—5$S—_“__’ 


n 


2. If no such integer exists such that for allr € R we have nr =r+r+---+r= 
—__ 


n 


0, then char(R) = 0. 


Example 7.14 One can easily verify the characteristic for each of the following 
rings: 


1; .chat(Z,) =a 
2. char(Z) = char(Q) = char(R) = char(C) = 0. 
oh char (Ze x Zs) = 30. 


The rest of the section deals with the notion of a prime subfield in a field. We 
will start with a broader context and narrow our way down to this notion. Let R 
be a ring with 1. We start by considering the following map ¢: Z > R by ¢(n) = 
nl=1+1+4.---+1. It is readily seen, using a proof by cases, that this map is a ring 


homomorphism. We show the case when n,m > 0: 
Proof 7.12 


d(ntm)=(n+m)l=14+14+---+1=1414---+141414---41 
ee 


n+m m n 
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=ml+nl=¢(m)+¢(n) and 
g(nm) = (nm) =1+14+---+1=(1¢14---+I(1414---41) 
ouwoc———qw— a 


= (m1)(n1) = 6(m)¢(n). 
Lemma 7.6 Let R be a ring with 1. 


1. If char(R) = n > 0, then R contains an isomorphic copy of Z,, namely the 
cyclic subgroup of (R,+) generated by 1. 


2. If char(R) = 0, then R contains an isomorphic copy of Z, namely the cyclic 
subgroup of (R,+) generated by 1. 


Proof 7.13 When char(R) = n > 0 then the map @ defined above has kernel nZ 
and image (1) and so by FTH, 


Zn 2 ZnB % (1) CR. 
When char(R) = 0, the map ¢ is a monomorphism and by FTH, 
Z=Z/{0} = (1) CR. 
Corollary 7.2 Let F be a field. 
1. If char(F’) = p prime, then F contains an isomorphic copy of Zp. 
2. If char(F’) = 0, then R contains an isomorphic copy of Q. 


Proof 7.14 The case when char(F) = p prime follows immediately from 
Lemma 7.6.1. For the case when char(F’) = 0, by Lemma 7.6.2, F contains an 
isomorphic copy of Z via the monomorphism @: Z— F by o(n) = nl. Now, by our 
work on quotient fields, we can eatend ¢ to a monomorphism ¢* :Q—> F. Thus, Q 
embeds in F. 


Definition 7.15 The subfield of F which is either Zp or Q in Corollary 7.2 above 
is called the prime subfield of F’. 


Remark 7.9 One can show that the prime subfield of a field F is the smallest subfield 
in F, i.e. any other subfield of F must contain the prime subfield. 


EXERCISES 


1 If Ris a ring with 1, then 


a. char(R) =n > 0 iff n is the smallest positive integer such that nl = 
1+1+---+1=0, ie. the order of 1 in (R,+) equals n. 
— 


n 


b. char(R) =0 iff the order of 1 in (2, +) is infinite. 
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2 If Ris a finite ring with 1, then char(R) divides | RI. 


3 If Ris an ID, then the non-zero elements of R are either all have finite order or 
all have infinite order in (R,+). 

4 If Ris an ID, then either char(R) =0 or a prime number 
note: be sure to rule out char(R) = 1. 

5 Complete the proof by cases that map @ : Z > R by ¢(n) = nl = 
1+1+---+1 is a ring homomorphism. 
—— 


n 


6 Prove that the prime subfield of a field F' is the smallest subfield in F’,, i.e. any 
other subfield of F' must contain the prime subfield. 


7.8. THE RING OF POLYNOMIALS 


Our focus in this section is the ring of polynomials in an indeterminant, R[x], first 
introduced in Section 7.1. The following properties are readily verified for a ring of 
polynomials which we leave as exercises: 


1. If Ris a ring, then R{2] is a ring. 
2. If R is commutative a ring, then R[x] is a commutative ring. 
3. If Ris a ring with 1, then R[z] is a ring with 1. 


Later on in the text, we will show that other properties of R are carried over to 
R[x]. In this section, we will show among other things that ID is carried over from R 


to Riz]. 


Definition 7.16 Consider the ring of polynomials R\x| in the indeterminant x and 
let p(x) = ana” +--+ + a1 4+ ao € Riz] with ay # 0. 


1. The degree of p(x), written deg(p) = n. Note that the degree of the zero 
polynomial will be assigned the value —oo and we will extend addition to include 
—oo as follows: 


(—00) + (—co) = —00 (—0o) + n = —c0 = n+ (—oco). 
2. The leading coefficient of p(x), written L(p) = an. 
3. A polynomial is called monic if L(p) = 1. 


Lemma 7.7 Let R be a commutative ring with 1 with p(x) and q(x) two polynomials 
in Ria]. 


1. deg(pq) < deg(p) + deg(q) 
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2. If L(p) is not a zero divisor, then L(pq) = L(p)L(q) and deg(pq) = deg(p) + 
deg(q). 


Proof 7.15 First note that if p(x) or q(x) is the zero polynomial, then p(x)q(x) is as 
well with L(pq) = 0 = L(p)L(q) and deg(pq) = —oo = deg(p) + deg(q). Assume now 
that p(x) and q(x) are not the zero polynomial. Set p(x) = anu” +--+ a,x+ a9 and 
Q(x) = bmx™ +++- + bir + bo. Then p(x)q(x) = (Gnbm)a™*” + +++ + (abo + agbi)a + 
(aobo). It’s clear then that the degree of p(x)q(x) can be at most m+n and when 
L(p) is not a zero divisor, then L(pq) = Gnbm = L(p)L(q) and deg(pq) =n+m= 
deg(p) + deg(q). 
Theorem 7.4 Let R be an ID with p(x) and q(x) two polynomials in R[x]. 

1. deg(pq) = deg(p) + deg(q). 

2. Ria] is an ID. 

3. U( Aig) =U (8). 
Proof 7.16 Since R has no zero-divisors, the first part follows immediately from 
Lemma 7.7.2. 

For the second statement, suppose that p(x) and q(x) are not the zero polynomial. 
Thus, deg(p),deg(q) > 0 and so deg(pq) = deg(p) + deg(q) > 0. Hence, neither is 
p(x)q(x) the zero polynomial and R[x] has no zero divisors making it an ID. 

For the third statement, it suffices to show that every unit in Rix] must be a 


constant polynomial. Suppose that p(x) € U(R[x]) and q(x) € R[x] is such that 
p(x)q(x) = 1. Then 


0 = deg(1) = deg(pq) = deg(p) + deg(q), 


and so it is necessarily the case that deg(p) = deg(q) = 0 making p(x) a constant 
polynomial. 


Example 7.15 These examples discuss the units in certain polynomial rings. 
1, U(Z[z]) = U(Z) = {-1, 1}. 
2. For any field F the units in F'\x| consist of the non zero constant polynomials. 


3. An element r € R a ring is nilpotent if there is a positive integer n such that 
r” = 0. For example, 1,3,5,7 are units in Zg while 0,2,4,6 are nilpotent, since 
O'=0, 22> =0, 47 =0 and 6? =0. 


4. If R is a commutative ring with 1 and p(x) = anz" +--+ + a14 + ao € Riz], 
then p(x) ts a unit in R[x] iff ap is a unit in R and ay,...,an are nilpotent in 
R. For instance in Zg|z], an example of a unit in Zg[x| is 2x° + 6x? + 4x + 3. 


Definition 7.17 Let F be a field. A non-constant polynomial in F |] is called irre- 
ducible if it cannot be factored into two polynomials of lesser degree. Otherwise the 
polynomial is said to be reducible (or factorable). 


190 m Fundamentals of Abstract Algebra 


Example 7.16 In these examples we discuss reducibility in certain polynomial rings. 


1. Consider the polynomial ring Z2|x| and polynomials of degree two. The reducible 
polynomials are 


g=ae-n, 2 t+e=2(e+1), 2? 4+1=(¢4+1)(e+1). 

The only irreducible quadratic is x?+ax+1, for suppose x?+2+1 = (x+a)(x+). 
Then x? +x2+1=274+(a+b)x+ ab which impliesa+b=1 and ab=1, but 
this system of equations has no solution in Zo. 


2. Every polynomial of degree one in F'|x| is irreducible, for suppose p(x) € Fa] of 
degree one and p(x) = r(x)s(x) with deg(r), deg(s) < deg(p). But then it must 
be that deg(r) = deg(s) = 0, however 1 = deg(p) = deg(rs) = deg(r)+deg(s) = 
0, a contradiction. 


The next result will be useful in the upcoming chapter on integral domains. It 
show that polynomials over a field have a division algorithm just the way integers do. 


Theorem 7.5 If F is a field and f(x), g(x) are two polynomials in F |x] with g(x) 
non-zero, then there exist unique polynomials q(x), r(x) in Fa] such that 


f(x) = g(x)q(z) + r(x) where deg(r) < deg(g). 


Proof 7.17 This proof divides logically into two parts. 

Existence: Let’s first dispense with the trivial case of f = 0 in which case set 
q =r = 0. Now we prove the rest by induction on m = deg(f). If m = 0, then 
f(x) =a a constant polynomial. If deg(g) > 0, then take q(x) = 0 and r(x) = f(x) 
to satisfy the theorem, otherwise g(x) = 6 a non-zero constant and since F is a field, 
we can assign q(x) = b-'a and r(x) = 0 to satisfy the theorem. Now assume m > 0. 
Again, should it be the case that deg(f) < deg(g), then q(x) = 0 and r(x) = f(z) 
satisfies the theorem. For the case that deg(f) > deg(g), first seta = L(f), b= L(g) 
and d = deg(g). Notice that f\(x) = f(x) — ab-!aw™~4g(zx) has degree less than m, 
since we have effectively eliminated the leading term of f(x). By induction, there exist 
polynomials qi(x),ri(a) such that fi(~) = g(x)qi(x) + ri(x) with deg(r1) < deg(g). 
But then 


f(x) = file) + ab-*a'™ g(x) = g(x) q(x) + ria) + aba “g(x) 


= g(x) [qi(x) + ab-!2™-4] + ry (2). 


Therefore, q(x) = q(x) + ab-!a™~4 and r(x) =11(x) satisfy the theorem. 
Uniqueness: Suppose we also have that f(x) = g(x)G(x) + f(x) where deg(?) < 
deg(g). Equating yields |q(x)—G(x)|g(x) = r(x)—f(x). Therefore, deg(q—G)+deg(g) = 
deg(r — f) < deg(g). The only way this is possible is for deg(q — G@) = —ov, te. 
q(x) — G(x) is the zero polynomial and so q(x) = G(x). But then r(x) — f(x) equals 
the zero polynomial as well making r(x) = F(x). 
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We point out that the above result can be generalized to Riz] where R is a 
commutative ring with 1 and the divisor g(x) € R{z] has the property that L(g) € 
U(R). We leave it as an exercise for the reader to verify this fact (following the same 
proof as above). 


Definition 7.18 Let a © E D F where E and F are fields and f(x) a non-zero 
polynomial in F [x]. If f(a) =0, then a is called a root or zero of f(x). 


Corollary 7.3 Let F' be a field. 
1. The remainder when dividing f(x) € Fla] by x—ae€ Fla] is f(a). 


2. a € F is a zero of a non-zero polynomial f(x) € Flax] iff f(x) = (x—a)q(a) for 
some polynomial q(x) € Fla]. 


3. If ay,...,a% € F are zeros of f(x) € Fla], then f(x) = (a@—ay)--- (a —ag)q(z) 
for some polynomial q(x) € F[z]. 
4. The number of distinct zeros of a polynomial f(x) € Fx] does not exceed the 


degree of f(x). 


Proof 7.18 For the first statement, divide f(x) by x — a and conclude r(x) = f(a). 
The second statement follows immediately from the first statement. The third state- 
ment is by induction using the second statement. The fourth statement follows from 
the third statement. 


Remark 7.10 Here, we make some remarks regarding Corollary 7.3. 


1. We can illustrate Corollary 7.3.2 with the polynomial f(x) = x? -—x—2 € R{z]. 
One can check —1 and 2 are roots of f(x) and f(x) = (a —2)(x +1). 


2. We can illustrate Corollary 7.3.1 with the polynomial f(x) = 2? +1 € Riz]. 
If we divide f(x) by x —1, one can check that the division algorithm yields 
f(z) = (@-—1)(x4 +1) 4+ 2, so the remainder is 2 = f(1). 


3. We can illustrate Corollary 7.3.4 with the polynomial f(x) = x? -2x+1 € R{z] 
which has only one distinct root not exceeding its degree of 2. 


4. The final part of the Corollary relies heavily on the fact that F is a field. For 
instance, f(x) = x2+ 2 € Ze has four distinct zeros in Zg, namely 0,2,3,5. In 
fact, for the quaternions Q one can show the polynomial f(x) = x7 + 1 € Q[z] 
has an infinite number of zeros. 


5. We point out that the last part of the corollary can be proved in Rix] where R 
is an integral domain. We outline the proof and leave the details as an exercise. 
(a) For any positive integer n, the linear polynomial x—a is a factor of x"—a". 
(b) The linear polynomial x — a is a factor of f(x) — f(a). 
(c) Conclude that f(x) = (a — a)q(x) + f(a). 
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EXERCISES 

1 Prove that if R is a ring, then R[z] is a ring. 

2 Prove that if R is commutative a ring, then R[z] is a commutative ring. 

3 Prove that if R is a ring with 1, then R[x] is a ring with 1. 

4 Prove that if R is a commutative ring with 1 and p(x) = a,z"+---+a,4%+a9 € 
Rix], then p(x) is a unit in Rs] iff ap is a unit in R and ay,..., a, are nilpotent 
in R. 

5 R[x] where R is a commutative ring with 1. Prove that if f(x), g(x) are two 


7.9 


polynomials in F[z] with g(x) having the property that L(g) € U(R), then 
there exist unique polynomials q(x), r(x) in F'[2] such that 


f(x) = g(x)q(x) +r(x) where deg(r) < deg(g). 


Fill in the details of the proof of Corollary 7.3. 


Use the following outline to generalize Corollary 7.3 to R[x] where R is an 
integral domain: 


a. For any positive integer n, the linear polynomial x — a is a factor of «” — a”. 
b. The linear polynomial x — a is a factor of f(x) — f(a). 
c. Conclude that f(x) = (x — a)q(x) + f(a). 


Let @: Fa] > F[az] be a ring automorphism such that ¢(a) = a for alla € F 
a field. 


a. For a given f(x) € F|a], Prove degf = deg ¢(f) 
(hint: You will need to show that deg ¢(x) = 1) 


b. For a given f(x) € F [a], Prove that f is irreducible iff @(f) is irreducible. 
(hint: Use contrapositive) 


SPECIAL IDEALS 


We remind the reader that much of the study of rings in this text is based on gen- 


eralizations of properties of the integers. Continuing in this vein, we consider certain 


properties of ideals in the integers and generalize them to arbitrary commutative 


rings. 


Definition 7.19 Let R be a commutative ring and I AR with TF R. 


1. I is called prime ideal if for all r,s € R, whenever rs € I either r € I or 


sel. 
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2. I is called a maximal ideal if it is not properly contained in another proper 


ideal of R, i.e. If JAR andI CJ CR, then either J=I or J=R. 


Example 7.17 Here, we present several examples in order to illustrate the defini- 
tions just given. 


1. If 1 = (p) = pZ for p prime in Z, then I is both maximal and prime We first 
show I is a prime ideal. If mn € I, then mn = pk and so p divides mn. Since 
p is prime either p divides m or n and so either m € (p) =I orn € (p) =T. 
Second we show I is maximal. Suppose JAZ and I C J CZ. Now J = (n) 
for some integer n. Since p= p-1 € (p) C (n), this implies p = mn for some 
integer m. Since p is prime, eithern = +p and so J = (p) =I orn=+1 and 
so J = (1) = R. Later we will show these are the only prime (and maximal) 
ideals in Z and hence in Z the notions of prime and maximal ideal coincide. 


2. Let R = Zi] = {m+ni : m,n € Z} andi = V—1, called the Gaussian 
integers, and set 1 ={m+nie R : 3])m & 3|n}. One can easily show that 
IR, but we wish to show that I is, in fact, a maximal ideal in R. 


Proof 7.19 Suppose there was an ideal JAR such that IC J C R. Then there 
exists anm-+ni € J but not in I, so that either 3 Jm or3 Jn. In other words, 
m= 3k+r andn=3l+ 5 where0<1,s <3 but not both r and s equal 0. But 
then, since 3k + 3li EI C J we have (m+ ni) — (8k + 3li) =r+sie JAR. 
Again, since J is an ideal in R, we have r? + s* = (r+ si)(r — si) € J. Now 
because of the constraint on r and s we know that r? + s? is either 1, 2, 4, 5 or 
8. In either of these five cases we can always subtract an appropriate multiple 
of 3 (which is in I and so in J) to get a difference of 1. Therefore, 1 € J and 
soJ=R. 


The next result connects these special ideals with their corresponding factor rings. 


Theorem 7.6 Let R be a commutative ring with 1 and I< R. 
1. I is prime iff R/I is an ID. 
2. I is maximal iff R/T is a field. 


Proof 7.20 First assume that I is prime and we show R/I has no zero divisors. If 
(r+I)(s+JI) =I, thenrs+I=TI and sors € I. Since I is prime, either r € I 
or s EI and so eitherr +I =I ors+I=TI. Now assume that R/I is an integral 
domain. Ifrs EI, thenrs+I=I and so (r+I1)(s+JI) =I. Since R/I has no zero 
divisors, eitherr +I=TI ors+I=TI which implies eitherr € I ors € I. 

For the second statement we first assume that I is maximal and show every non- 
zero element in R/I is a unit. Ifr +I 41, thenr ¢ I. Define the following set: 


J={sr+a: se€Randael}. 
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First note that J < R, since if sr+a,s’r+b € J, then (sr +a) —(s'r +6) = 
(s—s')r+(a—b) € J and fort € R we have t(sr + a) = (ts)r + (ta) € J. Second, 
certainly J contains I since we may set s = 0 to obtain all the elements of I in J. 
Finally, I #4 J, sincer =1-r+0€ J while r ¢ I. Thus, since I is maximal, it 
must be that J = R and so in particular 1 € J. Then we can express 1 = sr +a for 
s€Randaeél andsol € sr+TI; this implies 14+I = sr+I=(s+D(r4+/J) 
and sor+TI has an inverse (note that we cannot simply take r~' + TI as the inverse, 
since r may not be a unit in R). Now assume that R/T is a field and suppose we have 
JAR with I CICR. First note that J/I a R/I since ifr+I,s+I1€ J/I, then 
(r+1)—(st+D) = (r—s)4I € J/I andift+I € R/T, then (t+ D (r+) = (tr)t+i € J/I. 
Since R/I is a field, and we’ve seen that a field has no proper non-trivial ideals, it 
follows that either J/I = {I} or J/I = R/I and so either J=I or J=R. 


Example 7.18 We can now give an alternate proof to the one presented in Exam- 
ple 7.17.2. 


Proof 7.21 We will show that R/I is a field and so by Theorem 7.6.2, it follows 
that I is maximal. Since R is a commutative ring with 1 it is enough to show that 
evey non-zero element in R/I is a unit. Therefore, take an (m+ni) +I € R/I with 
m+ni € I (and so either 3 does not divide m or 3 does not divide n). We wish to find 
an (x+yi)+JI € R/I such that [((m+ni)+ I] [(a+yi) +1] =14+T1. This is equivalent 
to saying that (m+ ni)(z + yt) —1 ET, te. (ma —ny—1)4+ (nt+my)i el. In 
other words, 3 must divide ma — ny —1 and must divide nx + my. This is equivalent 
to saying that the following linear system of equations has a solution in Z3: 


maz—-ny = 1 
ne+my = 


Now the coefficient matrix for this system is 
m —n 
-[" ) 
Therefore, the system has a solution in Z3 iff A is invertible in Mo(Z3) iff |Al 
is invertible in Z3. Notice that |A| =m? +n? 40 in Z3, since 3 does not divide both 


m and n. Therefore |A| is indeed invertible in Z3 and so backtracking through the 
argument we see that (m+ ni) +I is indeed a unit in R/T. 


Corollary 7.4 Let R be a commutative ring with 1. 
1. Every maximal ideal is prime. 


2. Ris a field iff R has no proper non-trivial ideals. 


Proof 7.22 For the first statement, I maximal implies R/I a field implies R/I an 
ID implies I prime. 

For the second statement, we have already shown one direction, so assume that 
R has no proper non-trivial ideals. Therefore, the trivial ideal {0} is maximal and so 


R= R/{0} is a field. 
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Remark 7.11 The converse of Corollary 7.4.1 is false in general. For instance, con- 
sider the ring with 1, R= Z x Z and the ideal I = Z x {0}. One can easily check 
that I is prime, however it is not maximal, since for instance J = Z x 2Z is an ideal 
properly between I and R. 

We shall show below, however, that maximal and prime are always equivalent 
notions in the ring of integers. We remind the reader that every ideal in the ring of 
integers is principal. 


Corollary 7.5 In the ring of integers, any ideal I = (n) is maximal iff n is prime. 


Proof 7.23 I maximal implies Z, = Z/(n) is a field and so n must be prime. On 
the other hand, n prime implies Z/(n) = Z,, is a field, and so (n) must be maximal. 


Proposition 7.1 In the ring of integers the following are equivalent for any ideal 
f= tnyliinZe 

1. I ts prime. 

2. I is maximal. 


3. n ts prime. 


Proof 7.24 What is left to show after all the work thus far is to show that I prime 
implies I maximal. Assume that I is prime and suppose there is an ideal J = (m) 
with IC J C Z. Without loss of generality we may assume that m,n > 0. Since 
nel CJ this implies n = mk. Thus mk € I and so ettherme I ork el. Ifmel 
we have m = nr and son = nrk and so rk = 1 which implies k = 1. Therefore, 
n=mandI= J. Ifk eI we have k = ns and son =mns and so ms = 1 which 
implies m =1. Therefore, J = R. 


EXERCISES 


1 Let R= F(R) andJ={f eR: f(1) =O}. Prove that J is a maximal ideal 
in R. 


2 Prove that for the ring with 1, R = Z x Z, the ideal J = Z x {0} is prime. 


3 Let R be a ring with unity and J, J,M < R with M maximal. Prove that if 
IN JCM, then either JC Mor JCM. 


(hint: Consider + M,J+M< R) 


4 Let P bea prime ideal of a commutative ring R. Show that for any ideals J and 
J with IN J C P we have that either J C P or J C P. 


5 Prove that a commutative ring in which every ideal is principal must have unity. 


6 Let R be aring with unity and r € R such that r 4 0,1 and r? = r. Show that 
R=(r)+(1—r) and (r)N (1 —r) = {0}. 


CHAPTER 8 


Integral Domain Theory 


N THIS CHAPTER, we further explore properties of the integers and by doing so 
| we define some special cases of integral domains. In Section 8.1, we introduce two 
of these special integral domains called Euclidean domain (ED) and principal ideal 
domain (PID). In Section 8.2, we introduce the third special integral domain called a 
unique factorization domain (UFD). In Section 8.3, we look at one particular integral 
domain which fails to be any of these three special integral domains. One area of 
study already discussed was to determine which property of a ring R carries over 
to the corresponding polynomial ring R[x]. In Section 8.4, we prove that the UFD 
property indeed carries over to the polynomial ring. 


8.1 EUCLIDEAN AND PRINCIPAL IDEAL DOMAINS 


In this section, we explore two properties of the ring of integers and look at other 
rings which share these properties. 


Definition 8.1 An integral domain R is called a Euclidean domain (ED) if there 
exists a function 6: R* + N having the property that for all a,b € R with b # 0, 
there exists q,r © R such that a = bq+r where either d(r) < 6(b) orr =0. 


Example 8.1 Here, we list some examples of integral domains. 


1. We have seen that the natural numbers has this property described in the def- 
inition above, but alas it is not a ring. However, the ring of integers forms a 
ED with 6(n) = |n|. This is a slightly different formalization of the Division 
Algorithm and is easy to verify as was done in Theorem 1.2. 


2. We have seen that F' |x| where F is a field is an ED with 6(f(x)) = deg(f(x)). 
This was verified in Theorem 7.5. 


3. The Gaussian integers, Zi], forms an ED with 6(m + ni) =m? +n”. We will 
verify this in detail at the end of the section. 


Definition 8.2 An integral domain R is called a principal ideal domain (PID) 
if every ideal in R is a principal ideal. In other words, for every I< R there exists 
ana € R such that I = (a). 
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Example 8.2 We have already seen that the ring of integers is a PID, since any 
ideal in Z has the form nZ = (n). But this argument will become moot, since we now 
show that every ED is a PID and hence the three examples given above must all be 
PIDs. We remark that it is no easy matter to come up with a PID which is not an 
ED. One can show that Z[z| where z is the complex number (1 + /—19)/2 is an 
example of a PID which is not an ED (we omit the proof). 


Theorem 8.1 Every ED is also a PID. 


Proof 8.1 Let R be an ED. If I is the trivial ideal, then I = (0) is principal, so we 
may assume that I is a non-trivial ideal in R. Since 6 maps into the natural numbers, 
there must be an element b € I of minimal 6-value. We show that I = (b) and thus 
complete the proof. Certainly, (b) C I, since b € I. For the reverse inclusion, take 
anyae€ Tl. Since R is an ED, there exist q,r € R such that a = bqg+r where either 
d(r) < 6(b) or r = 0. Notice that r= a— bq € I and since b has minimal 6-value in 
I it must be that r = 0 and so a= bq € (0). 


Example 8.3 We need to point out that not every integral domain is principal. One 
good example is Z|x]. It’s enough to show there is an ideal in Z|x] which is not 
principal. Select a prime number p and set I = (p,x) ={p- f(x) +2-9(z) : fige 
Z|x]}. Note that I< Z[x] since it is the ideal generated by p and x (see Exercise 6 in 
Section 7.5). 

We need to point out that I is a proper ideal in Z[x], for suppose that I = Z|x], 
then in particular 1 € I and so1=pf(x)+2g(x) for some f,g € Zax]. Set f(x) = 
apx* +---+a,x+ag and equate constant coefficients in the equation 1 = pf (x)+29(x) 
to get 1 = pag and so p= +1 which contradicts that p is a prime. 

Now we show I is not principal. Suppose it were, i.e. I = (h(x)) for some h € Z[z]. 
Now since p= p-1+a2-0, this implies p € I and so p=h(x)r(x) for some r € Z[z}. 
Notice then that 0 = deg(p) = deg(hr) = deg(h) + deg(r) and so deg(h) = 0 and 
deg(r) = 0. Set h(x) =m and r(x) =n and so we have the integer equation p = mn. 
Now p is prime so eitherm = +1 orm=+p. Ifm= +1, then I = (1) = Z|z] which 
we have shown is not possible. Hence, m = +p and I = (p) an ideal generated by the 
prime p. Nowx =p-0+2a-1andsox El. Hence, x =p- s(x) for some s € Zz]. 
Notice that 1 = deg(x) = deg(ps) = deg(p) + deg(s) = deg(s). Hence, s(x) = kx +l 
a linear polynomial. Therefore, x = p(kx + 1) = (pk)x + (pl). Equating x coefficients 
yields the equation 1 = pk again contradicting that p is a prime. 


For the remainder of this section we generalize some notions found in Z to rings, 
namely divides, greatest common divisor and prime. 


Definition 8.3 Let R be a commutative ring with 1 andr,s € R withr £0. We say 
r divides s and write r|s, if there exists ana € R such that s = ra. 


Remark 8.1 We make several remarks which we leave to the reader to verify. 


1. We note that r|s is equivalent to s € (r) which is equivalent to (s) C (r). 
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2. In a field F every non-zero element divides every element in the field, since if 
a,bé F witha #0, then b= a(a~'b) and so alb. 


Example 8.4 Here, we list several examples in different rings illustrating the notion 
of dividing. 


1. InZ, the integer —6 divides 30, since 30 = (—6)(—5) 


2. In F[a], for some field F, the polynomial x? +1 divides x —1, since xt -1= 
(x? + 1)(x? — 1). 


3. InR=Z|V—5| we have that 2+./—5 divides 9, since 9 = (2+./—5)(2—W—5). 


Here are some additional properties of divides which we leave to the reader to 
verify. The proofs are nearly identical to the ones presented for Z. For R be a com- 
mutative ring with 1, 


1. For all r € R we have r|r. 
2. If for r,s,t € R we have r|s and s|t, then r|t. 


3. If for r,s,t € R we have r|s and r|t, then r|(sx + ty) for any x,y € R. 


Lemma 8.1 Let r,s € R be a commutative ring with 1. 
1. If s=ru for some u € U(R), then (s) = (r). 
2. When R is an ID, then s = ru for some u€ U(R) iff (s) = (r). 


Proof 8.2 For the first statement, given s = ru, be definition r|s and so (s) C (r) 
(see Remark 8.1.1). Since u € U(R) we know u-! exists so that we can rewrite 
equation s =ru as r = su! so that s|r which implies (r) C (s) as well. 

For the second statement, we already have one direction from the first statement, 
so assume that (s) = (r). Since r € (r) = (s) we know r = sa for some a € R, and 
since s € (s) = (r) we also know s = rb for some b € R. Hence, r = sa = (rb)a and, 
by cancellation in an ID, 1 = ba which implies b € U(R) with s = ba. 


Definition 8.4 Let R be a commutative ring with 1. We say r,s € R are associates 
if both r|s and s|r. 


Example 8.5 We illustrate the notion of associate in several different settings. 


1. InZ any number and its opposite are associates, such as 3 and —3. These are 
the only kind of associates in Z (see Lemma 8.1.1). 


2. In F |x] any polynomial and a constant multiple of itself are associates, such as 
24 +1 and —6x — 3. 


&. In a field all non-zero elements are associates, since every non-zero element 
divides any element in the field. 
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Remark 8.2 Here are some additional remarks about associates some of which we 
leave to the reader to verify. 


1. r and s are associates imples that r = su for some u € U(R). 
2. r and s are associates iff (r) = (s). 
3. The notion of associates defines an equivalence relation on R. 


4. When R is an ID, then Lemma 8.1 says r and s are associates iff they differ by 
a unit. In the eramples above both rings were IDs and notice that the associates 


did indeed differ by a unit. 


Definition 8.5 Let R be an ID and r,s € R*. An element d © R is a greatest 
common divisor (gcd) of r and s, if 


rae ae) 
2. d\r and d|s (common divisor) 


3. Ife € R with elr and els, then eld (greatest) 


Remark 8.3 We make several remarks which we leave to the reader to verify. 


1. The gcd of two ring elements is never unique. In fact, from the definition of 
gcd one can quickly show that any two elements of a ring are gcd’s of the same 
pair of ring elements iff they are associates. 


2. If F is a field, then the gcd of any pair of elements is any non-zero element in 
PF. 


Example 8.6 Here, we illustrate the notion of gcd in several settings. 


1. Take the case of the ring Z. Both 6 and —6 are gcd’s of 30 and 12. Notice that 
they are associates differing by the unit —1. We typically say the gcd(30, 12) = 
6, the positive one. Thus, if we require the gcd to be positive, then in this ring 
it becomes unique. 


2. Take the case of the ring F(z] for some field F. Both x7 +1 and —2x? — 2 are 
ged’s of x*—1 and x? +2x?+2+2. Notice that they are associates differing by 
the unit —2. We typically say the gcd(30,12) = 27+, the monic one. Thus, if 
we require the gcd to be monic, then in this ring it becomes unique. 


3. It is not true that every ID has a well-defined gcd. Take the case of Z[,/—5]. 
The elements 9 and 6+3./—5 do not have a gcd. we leave the details for a later 
section, but basically +3 and +(2 + /—5) are the only common divisors of 9 
and 6 + 3\/—5 and yet 3 does not divide 2+ ./—5 nor does 2+ W/—5 divide 3. 


Theorem 8.2 In a PID the gcd exists. 
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Proof 8.3 Let r,s € R* where R is a PID. Consider the ideal (r) +(s). Since R is a 
PID, there exists d € R such that (d) = (r)+(s). We show that d is a gcd of r and s. 
First of alld £ 0, for otherwise (r)+(s) = {0}, but then r = r-1+0 € (r)+(s) = {0} 
which implies r = 0, a contradiction. Second, since r € (r) + (s) = (d) this implies 
that d\r. Similarly d|s. Finally, suppose that e|r and els. Then r = ea and s = eb 
for some a,b € R. Now, d € (d) = (r) + (s) and sod=rx+sy for some x,y € R. 
Therefore, d = eax + eby = e(ax + by) and so eld. 


Corollary 8.1 Let R be a PID andr,s € R*. 
1. If d is a gcd of r and s, then there x,y € R such that d= ra + sy. 


2. 1 is a gcd of r and s iff there exist x,y € R such that 1 =ra-+ sy. 


Proof 8.4 As in the proof above, (d) = (r)+(s) with d a gcd of r and s (note, since 
gcd’s are associates, they generate the same principal ideal). Since d € (d) = (r) +(s) 
the first statement follows. 

For the second statement, we already have one direction, so suppose there x,y € R 
such that 1 = rx+sy. Ifd is a gcd of r and s, then d|r and d|s and so d\(rx+sy), i.e. 
d\1 and there exists t € R such that 1 = dt. In other words, d is a unit, an associate 
of 1, and so 1 is a gcd of r and s. 


Definition 8.6 Let R be a commutative ring with 1 and p © R with p #0 and p not 
a unit. 


1. p is prime ¢f for all r,s € R, whenever p|rs either p|r or pls. 


2. p is irreducible if whenever r|p for some r € R, then either r is a unit or r 
is an associate of p. 


Remark 8.4 We make several remarks which we leave to the reader to verify. 


1. Let R be a commutative ring with 1 and p € R with p = ab. Then p is irreducible 
if exactly one of a and 6 is a unit and the other is an associate of p. 


2. If R is an ID, then p is irreducible iff the only way to factor p is as a unit times 
an associate. 


3. If p is prime, by induction one can easily show that p|(rirg++:Tn) implies p|r; 
for some i € {1,2,...,n}. 


4. There are no primes nor irreducibles in a field. 


Example 8.7 In these examples we illustrate prime and irreducible, and we leave 
several remarks as exercises. 


1. When R= Z, then prime and irreducible are equivalent — for instance 3 is both 
prime and irreducible 
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2. When R = F|a], then prime and irreducible are equivalent — for instance all 
linear polynomials are both prime and irreducible. When F = R, the polynomial 
x? +1 is both prime and irreducible. 


3. When R = Z[\/—5], the notion of prime and irreducible are not equivalent — 
for instance, 3 is irreducible, but not prime. The details of this fact are given 
in a separate section, but basically since 3-3 = (2+ V—5)(2 — /—5) = 9, we 
have that 3 divides (2+ /—5)(2 — /—5) yet 3 does not divide (2+ /—5) nor 
(2 — /—5). 


4. When R= Z{a]| linear polynomials are not all irreducible — for instance, 2x + 2 
is not irreducible, since 2x-+2 = 2(x+1) where 2 is neither a unit nor associate 
of 2x + 2. Neither is 2x +2 prime, since 2x + 2 divides 2(2 +1), but 2x + 2 
does not divide 2 norx+1 in R. 


The next result gives some insight into the general relationship between prime 
and irreducible. 


Lemma 8.2 In an ID, prime implies irreducible and in a PID, prime and irreducible 
are equivalent. 


Proof 8.5 Assume that p € R an ID and p is a prime, and suppose that r|p. Then 
there exists s such that p= rs. Now certainly p divides rs so that either p|r or p|s. 
Since both r and s divide p, then either r or s is an associate of p (and the other is 
a unit). 

Since R a PID is an ID, we already have one direction, so assume p € R is 
irreducible. We show that p is prime. If for some r,s € R we have p|rs, then either 
pls (and we’re done) or p{ s. In the latter case we now show that p|r. Since R a 
PID we know the gcd, say d, of p and s exists. So then d|p and d\s which implies 
(p) € (d) and s € (d). Since p{ s we know s ¢ (p) and so (p) is properly contained in 
(d). But p € (d) and so p= da where a is not a unit (for otherwise (p) = (d)). But 
then d must be a unit, and every unit is an associate of 1, which means 1 is a gcd 
of p and s. Therefore, by Corollary 8.1, there exist x,y € R such that 1 = px + sy 
and sor = pra+rsy. Since p divides prx and p divides rs it follows that p divides 
prze+rsy=r. 


Here now is the promised proof that the Gaussian integers are an ED. 
Proposition 8.1 The integral domain Z[i| is an ED. 


Proof 8.6 Define the map 6 : Z[i] + N by d(m+ni) = m?+n?. We need to first show 
that 6 is multiplicative, i.e. for allm+ni,x+yi € Zi] we have 6(m+ni)d(x+ yi) = 
d[(m + ni)(x + yi)]. We verify this directly: 


6[(m + ni)(x + yt)] = 6[(ma — ny) + (my + nz)i] = (ma — ny)? + (my + nz) 


= ma? — Imnazy + n?y? + my? + Wmnay + n22? = m2? + n?y? Zu? +n? x? 


= (m? + n”)(x? + y*) = 6(m + ni)d(x + yi). 
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Take any 21,22 € Zit] with z2 £ 0. Our goal is to divide z1 by z2 and obtain a 
quotient q and remainder r such that z1 = z2qg+1r with d(r) < d(z2). To do this, 
first note that z1/z2 € Q\t] (multiply top and bottom by the conjugate of z2). Set 
2/22 =r+ si for some r,s € Q. Imagine the elements of Zli] as a grid of points 
in the complex plane with r + si falling within some one-by-one square of that grid. 
Viewed this way it is clear that there is an m+ni € Z with |m—r| < 1/2 and 
ln —s| < 1/2. Stéu=r—m andv=s—n. Then 


2 = 2o(rt+ sti) = 2o[(m+u) + (n+ v)i] = zo(m4 nt) + zo(ut vi). 
We show that q= m-+ni andr = zo(u+vt) are the q and the r we seek. Certainly 
q € Zi] and sor = z% — z2q € Zit] by closure in the ring Zi]. There is the possibility 
that 2/22 € Zi] and so r =0. Otherwise, 
O(r) = 6[z2(ut vi)] = 6(z2)d(ut+ vi) = 5(z2)(u? + v) 


Zhi € is i) = 55(22) < 6(22). 


EXERCISES 


1 Verify the three statements in Remark 8.1. 
2 Prove the following: Let R be a commutative ring with 1. 
a. For all r € R we have r|r. 


b. If for r,s,t € R we have r|s and s|t, then rt. 


c. If for r,s,t € R we have r|s and r|t, then r|(sx + ty) for any z,y € R. 


3 Let S < Ra commutative ring with a,b € S*. Suppose d is a ged of a and b in 
Sand d=ra+sb for some r,s € S. Prove that d is a gcd of a and b in R. 


4 Prove that the gcd of two integers in the ring Z is the same in Z|i]. 


5 Let f(x) € R[x], where R is a commutative ring. Prove that for a € R, if x—a 
is a common divisor of f(x) and f’(x), then (x — a)? divides f(z). 


(hint: all derivative rules from Calculus 1 apply here) 
6 Verify the first two statements in Remark 8.2. 
7 Verify the two statements in Remark 8.3. 
8 Verify the four statements in Remark 8.4. 
9 Verify the statements given in first two examples of Example 8.7. 


10 Let F be a field and R be the subset of F'[a] consisting of polynomials with no 
x-term. 


11 


12 


13 


14 


15 


16 


17 


8.2 
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a. Carefully verify that R is an ID by showing it’s a subdomain of F'|z]. 
b. Verify that J = {a7 f(x) + 2°9(z) | f,g€ R} AR. 
c. Prove that R is not a PID by showing that J in part b is not principal. 


Prove that in a PID every non-trivial proper ideal is contained in a maximal 


ideal. 
Prove that every non-trivial prime ideal in a PID is maximal. 


Let R be a commutative ring with 1. Prove the following are equivalent for pa 
non-zero, non-unit in R: 


a. pis a prime. 

b. (p) is prime. 

c. R/(p) is an ID. 

Let R be an ID. Prove that if Riz] is a PID, then R is a field. 


Let a,b € Ran ED. 


a. Prove if a and b are associates, then d(a) = 6(b). 
b. Prove that 6(a) < 6(ab) iff bis not a unit in R. 
c. Prove 6(1) < 6(a) for all non-zero a € R. 

d. Prove a non-zero a € Risa unit iff 6(a) = d(1). 


e. Prove that for any non-zero, non-unit a € R that 6(a”) < 6(a"*!) for any 
non-negative integer n. 


Let R be an ED and suppose a,b € R with bla but a { b. Prove that 5(b) < d(a). 


Let R be an ID and p € R non-zero, non-unit. p is irreducible iff (p) is maximal 
among principal ideals, i.e. if (p) C (r) C R, then either (r) = (p) or (r) = R 
(you'll need to consider two cases: when r is a unit and when r is not). 


UNIQUE FACTORIZATION DOMAINS 


In this section, we explore one final property of the ring of integers and define a 
special family of integral domains which is a larger family of IDs than the two already 
investigated: ED and PID. 


Definition 8.7 Let R be an ID andr € R* —U(R). 


1. 


We say r has factorization if there exist irreducibles p1,po,...,Pn € R such 
that r = pjp2---Pn forn> 1. 
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2. We say R has factorization if every non-zero, non-unit in R has factorization. 


3. We say r has unique factorization jf it has factorization and whenever r = 
D1p2°°* Pn and r = qig2°+:Gm for some irreducibles pi,...-,;Pn,1,---;4m € R, 
it must be that n = m and there exists a permutation 0 € Sy such that p; and 
do(i) are associates for alli =1,2,...,n. 


4. Ris a unique factorization domain (UFD), if every non-zero, non-unit in 
R has unique factorization. 


Example 8.8 Our immediate goal is to show that PID implies UFD and so ED 
implies PID implies UFD. Thus, all PID examples we have already seen are examples 
of UFDs, such as Z, F |x], and Zi). 


1. For instance, in Z, two factorizations of —90 are (5)(—2)(3)(3) and 
(—2)(—3)(3)(—5), but o = (1 4 2) € Sy matches up the first set of irreducibles 
with their associates in the second. 


2. Z|/—5] is an ID which is not a UFD. For instance, 9 can be factored as (3)(3) 
and (2+./—5)(2—/5), but 3 is not an associate of either 2+./—5 or 2—/—5. 


We will cover the details of this in the next section. 


3. Consider the polynomials in Fla] for a field F which have no x-term. One 
can show that R is an ID (show its a subdomain of F|x]) and that x? and x3 
are irreducibles in R which are certainly not associates (since U(F|x]) = F*). 
Notice then that x®° = (x?)(x?)(x?) = (x°)(x3) are two different factorizations 
of x® into irreducibles. 


Definition 8.8 A ring has the ascending chain condition (or ACC ) on principal 
ideals if there are no infinite chains of the form 


(Eo Aa) Belts) Goes 
Example 8.9 We will illustrate the failure of ACC in a couple of examples. 


1. Consider the ring R= Z+2Q|z], polynomials with rationals coefficients except 
for the constant coefficient which must be an integer. Then R fails to have ACC 
on principal ideals, since 


(t)G 


s(¥)9() (3) 


2. For any polynomial ring R[x] with R a ring with 1, ACC fails for ideals (not 
assuming principal). Take the ideal I, to be polynomials of degree < n, then 


hel Gipees, 


Lemma 8.3 Let R be an ID. If R has ACC on principal ideals, then R has factor- 
ization. 
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Proof 8.7 We prove the contrapositive statement. If R does not have factorization, 
there exists an, € R* —U(R) which does not have factorization. Certainly r, cannot 
be an irreducible so that r1 = ab for some non-units a,b € R. Since r, does not have 
factorization, at least one of a and b does not as well. Let’s say its b and set rg = b. 


Claim 8.1 (71) G (r2). 

First, since rg divides r; we have (ri) C (r2). Second, (r1) 4 (r2) for otherwise 
we’ve seen that r; = urg for some u € U(R). Since ry = arg by equating and 
a 


cancellation we would have a= u € U(R), a contradiction. 


The method above can be repeated on rz to produce an r3 with (r1) & (r2) & (13). 
And this method can be repeated to construct an infinite chain of principal ideals with 
each properly contained in the next. Hence, R does not have ACC on principal ideals. 


Theorem 8.3 /f R is a PID, then R has factorization. 


Proof 8.8 We show that R has ACC on principal ideals and then appeal to 
Lemma 8.3. Suppose to the contrary that R did not have ACC on principal ideals. 
Then there would exist an infinite chain of the form 


(ri) & (r2) | (73) Ge. 


Set I = UP (Tn) which one can show is an ideal in R (Exercise 2). Since R is 


a PID, there exists anr € R such that I = (r). Since r € I this implies r € (rz) for 
some k and so (r) C (rz). But then 


(r) & (Tx) & (Teta) S++ | (7), 


which is an obvious contradiction. 


Lemma 8.4 If R is an ID which has factorization, then R has unique factorization 
iff every irreducible in R is a prime. 


Proof 8.9 First assume that R have unique factorization. Suppose p € R is irre- 
ducible and p|rs for some r,s € R. So then rs = pt for some t € R. We may assume 
that neither r nor s are units, for ifr or s is a unit, the result easily follows. For 
instance, if r is a unit, then s = ptr~+ and p|s. Now t is not a unit, for otherwise 
p=r(st—') which would contradict that p is irreducible (since neither r nor st~* are 
units). By assumption we can express each of r, s andt as a product of irreducibles, 
4.€. 
r= Pip2°** Pk; S = Pk+1Pk+2°°* Pns t= %192°**dm- 


Then pip2-++Pn = pdig2°**Im, two factorizations of a ring element into irre- 
ducibles. Since R has unique factorization, in particular, p must be an associate of 
some p; and so p\p;. If1<i<k, then p\lr andifk+1<i<n, then pls. 

Second, assume that every irreducible in R is also prime. Suppose that r € R* — 
U(R) with r = pype-++ Pn and r = q192°+:dm where the p; and q; are irreducibles. 
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We will prove this direction of the proof by induction on n+m. Whenn+m = 2, 
then r = py = q and the result follows trivially. When n+ m > 2, note that since 
D1P2°°* Pn = 192°°* Am this implies p, divides qiq2--:dm- Since p is also prime, this 
implies that pi|qx for some k and so (qx) © (pi). Since (qx) and (pi) are maximal 
among principal ideals (see Exercise 17 in Section 8.1), it must be that (qx) = (p1) 
which makes q, and p, associates. By reordering the q; in the factorization of r we 
may assume that qx = q (since unique factorization is up to a permutation). Write 
q = piu for some u € U(R). By cancellation, py-+-Pyn = (ug2):+:dm. We can 
rename Ugz as gz since we are simply looking for associate pairs. Now by induction, 
n =m and the irreducibles p2,-++ ,Py can be put into one-to-one correspondence with 
G2,--+-;%m 80 that each pair are associates. 


Corollary 8.2 Every PID is a UFD. 


Proof 8.10 By Theorem 8.8, R has factorization. By Lemma 8.2, the notion of 
prime and irreducible are equivalent in a PID and so, by Theorem 8.4, R is a UFD. 


Corollary 8.3 In a UFD the notion of prime and irreducible are equivalent. 
Proof 8.11 This is an immediate consequence of Theorem 8.4. 


One final goal concerning UFDs is to show that the property of UFD is carried 
over from a ring R to the ring of polynomials R[x]. For instance, it will follow that 
Za] is a UFD, since Z is a UFD. This will be achieved in Section 8.4. 

Figure 8.1 is a picture of the world of integral domains. 


EXERCISES 


1 Verify that in Example 8.9.1 that FR is a ring. 


2 Prove that if Ris a ring and 4, C Ig C Jz C--- is an infinite chain of ideals in 
R, then the union 


| tee. 
i=l 


3 Show that in a UFD the gcd of two non-zero elements exists. (hint: if r,s € R* 


factor each into irreducibles, r = p{' ps? --- per and s = pl pe -»+ pin. For each i 


set m; = min {e;, f;} and show p7"'ps”? --- pl is a gcd of r and s). 


8.3. ONE PARTICULAR INTEGRAL DOMAIN 


We now look carefully at Z[,/—5]. It gives us a good example of when things go 
wrong for an ID, for this ID is not a UFD and the notion of prime and irreducible 
are not equivalent in this ID. The proofs use a notion of a norm, which has not been 
formally introduced, but easy to understand. First off, it is an integral domain. To 
see this we point out that Z[,/—5] is a subset of the integral domain C and so it is 
enough to show it is a subdomain of C. There are three things to check: 
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ID 
Z[/—5], R= { a9 + aor? +---+a,2" : ag€ Fone NnF1 } 
UFD 
Z{x] 
PID 
Z{(1 + Y—19) /2] 
ED 
Z, F(x], Z{i] 
FIELD 
Zp 
Q 
R 
Cc 


Figure 8.1. A snapshot of the landscape of integral domains. 


1. 1 ER, since 1 = 1+ 0-5. 
2. (a+ b/—5) — (c+ dV—5) = (a—c)+ (b—d)/—-5 ER. 
3. (a+ bV—5)(c + dV—5) = (ac — 5bd) + (ad + be)\/—5 € R. 


We now introduce an important function which is very useful in analyzing subrings 
of C. 


Definition 8.9 The norm of a complex number, written 6(z) = 2z for z €C. 


In other words, the norm of a complex number is the square of its magnitude. 


For instance, for a+ b/—5 € Z[V—5], 
6(a + bV—5) = (a + bV—5)(a — bV—5) = a? + 507. 


The norm function is multiplicative, i.e. 6(z1z2) = 6(21)6(z2), since 


6(2122) = 21292122 = 242224 29 = 21212222 = 6(21)0(22). 


Let’s put the norm to work right away. 
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Claim 8.2 The only units in Z|,/—5| are +1. 


Proof 8.12 Suppose a + b\/—5 € U(Z[\/—5]). Then there exists ac+d/—5 € 
Z{[/—5| such that (a + b\/—5)(c + dV/—5) = 1. Taking the norm of both sides yields 
(a? + 5b*)(c? + 5d?) = 1, but then as a positive integer a? + 5b* = 1 and the only way 
this is possible for integers a and b is a= +1 andb=0 and soa+b/—5 = +1. 


Claim 8.3 The gcd cannot be defined on Z|,/—5). 


Proof 8.13 To see this we give an example of two numbers in Z[\/—5| which have 
no gcd, namely 9 and 6 + 3\/—5. We shall see this in several steps. 

First of all, we need to show that the only proper divisors of 6 + 3\/—5 are +3 
and +(2+ \/—5). For suppose 6+ 3.\/—5 = (a+ b\/—5)(c + dV—5). Taking the norm 
of both sides yields 81 = (a? + 5b?)(c? + 5d?) and so a? + 5b? = 1,3,9,27 or 81. By 
symmetry of the factors it is enough to consider the values 1,8 and 9. The value 1 has 
only solution a = +1 and b=0 and soa+bV/—5 = +1. The value 3 has no solution 
for integers a and b. The value 9 has two possible solutions: a= +3, b =0 and so 
a+b/—5 = +3 ora=+2,b=+1 and soatb/—5 = +(2+ V—5) or +(2— V5). 


But in the latter case we would have 
64+ 3V—5 = (2— V—5)(c + dV—5) = (2c + 5d) + (2d — c)V—5. 


This gives the following linear system of equations: 


2c+5d=6 
2d—-—c=3 - 


But this linear system does not have an integer solution and so 2 — /—5 cannot 
be a factor of 6+ 3/—5 (similarly, neither can —(2 — /—5) be a factor). 


Second, notice the following factorizations: 
9=3-3= (2+ V—-5)(2— V—-5) 64+ 3V—5 = 3(24+ V—5). 


Thus we see that +3 and +(2 + /—5) are common divisors of 9 and 6 + 3/—5 
and by what we have just shown above they are the only common divisors and thus 
are the only candidates for the greatest common divisor. 

Third, we now show that 3 does not divide 2 + /—5 nor does 2 + /—5 divide 3. 
We first suppose that 3 divides 2+ \/—5. It follows that 2+ /—5 = 3(a+ bV—5) 
and taking the norm of both sides yields 9 = 9(a? + 5b?) and so a? + 5b? = 1. 
Therefore, a+ b/—5 = +1 and so 2+ ./—5 = +3, a contradiction. Similarly, if 
3 = (2+./—5)(a+b/—5), then taking the norm yields once more that 9 = 9(a?+5b?). 

Now we see the gcd(3,6 + 3/—5) cannot exist, since if we choose one of the 
candidates to be the greatest, then the other candidate should divide it, but it doesn’t. 


Claim 8.4 In Z[/—5] the notions of prime and irreducible are not equivalent. 
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Proof 8.14 We shall show that the number 8 is irreducible, but not prime. First, 3 
is irreducible, since if 3 = (a + b/—5)(c + d\/—5), then taking norms of both sides 
yields 9 = (a? + 5b?)(c? + 5d?) and so a* + 5b? = 1,3 or 9. It’s enough to consider 
the values 1 or 3. In the case of value 1 as we have seen a+ b\/—5 = +1 a unit. In 
the case of value 3 we’ve seen there is no integer solution. Second, 3 is not prime, 
for notice 3 divides 9 = (2 + \/—5)(2 — /—5) yet we've seen that 3 does not divide 
2+ /—5 and with a similar argument one can show 3 does not divide 2 — \/—5. 


Claim 8.5 Z|W—5] is not a UFD 


Proof 8.15 To see this we shall produce a number in Z|,/—5| which has more than 
one factorization and these factorization are truly distinct. Consider 


9=3-3 9 = (2+ V—5)(2 — V—5). 


We’ve seen already that 3 is irreducible. We show that 2 + \/—5 is irreducible 
(the argument for 2 — \/—5 being irreducible is similar). Suppose 2+ /—5 = (a+ 
b/—5)(c+ d\/—5). Taking norm of both sides yields 9 = (a? + 5b?)(c? + 5d?) and so 
as in Claim 8.4, the only possibilities for a+ b\/—5 are £1, a unit. Finally, 3 is not 
an associate of 2+ ./—5, since the only units are +1 and they certainly do not differ 
by 41. 


EXERCISES 


1 Consider the set R = Z|,/—p] for some prime number p € Z. 


a. Prove that R is an integral domain, by showing it is a subdomain of C (what 
does a typical element in R look like?). 


b. Define a norm N on R as follows: N(a+ ib,/p) = a? + b*p. Prove that N is 
multiplicative, i.e. N(2122) = N(21)N (9). 


c. Prove for z € R that z is a unit iff N(z) = 1. 


d. Show that every non-unit in R has a factorization into irreducibles. 
2 Explore the Gaussian integers R = Z/i). 


a. Define a norm on R and show it’s multiplicative. 


b. Define a norm N on R as follows: N(a-+ ib) = a? + b?. Prove that N is 
multiplicative. 


c. What are the primes (also irreducibles) in R? 


8.4 POLYNOMIALS OVER A UFD 


Our goal in this section is two-fold. First, we wish to show that polynomials whose 
coefficients come from a UFD is again a UFD, hence another property preserved when 
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we go from a ring R to Ria]. We point out that this does not hold for ED nor PID. 
Simply consider the ring Z which is both an ED and PID, yet Z[z] is neither an ED 
nor PID. 

Second, we explore when a polynomial over a UFD is irreducible. Both goals stem 
from a result attributed to Gauss which we will first prove. We will often refer to the 
quotient field of a UFD, so if R is a UFD, then Q will represent the quotient field of 
R. 


Definition 8.10 Let R be a UFD and f(x) € Riz]. 
1. The content of f, written C(f) is the gcd of all the coefficients of f(x). 
2. f is primitive if its content is 1. 


Example 8.10 Let f(x) = 30x? — 12% + 60 € Z[a]. The content of f in this case is 
6 (or —6). Notice that f(x) =6f(x) where f(x) = 5x? +2ar +10. It’s easy to see that 
for any f(x) € R[x] we can express f(x) = C(f)f(x) where f(x) € Riz] is primitive 
and this representation is unique up to a unit, since content is unique up to a unit 
(since this is true of gcd). This fact remains true in Q|x] as we prove now. 


Lemma 8.5 /f R is a UFD and Q = Q(R) with f(x) € Q[a], then f(x) = cf (x) 
where c € Q and f(x) € R[x] is primitive. Furthermore, this representation is unique 
up to a unit. 


Proof 8.16 Write f(z) = fra" +---+ Hat F € Q[a]. Set b = body +++ bn. Then 
certainly bf(x) € R[x]. Set g(x) = bf(zx). ‘As in the example we can express g(x) = 
C(g)g(x) where g(x) € R[x] is primitive. Then f(x) = M9) 5 (x) and we have the 
required representation. 

Now suppose f(x) = cf(x) and f(x) = dh(x) where c,d € Q and f(x),h(x) € 
Rix] primitive. Write c = ¢ and d = © and equate to get 2 F(x) = th(x). Then 
as f(x) = brh(x). Set p(x) = asf(x) = brh(x). Since f(x) and h(x) are primitive, we 
have C(p) = as and C(p) = br. Since content is unique up to a unit, as = ubr for 
some unit u € U(R). Then c = ud and so udf(«) = dh(«x) which implies uf(«) = 
agen 


Example 8.11 We will illustrate the proof with an example. Let f(x) = By? + Bert 
2 € Q[z]. Then 30f(x) = 3502? + 1682 + 315. Now C(3502z? + 168x + 315) = 7 so 
that 30f(x) = 7(50x? + 27% + 45) and hence f(x) = £(50x? + 27x + 45), where 
f(a) = 50x? + 272 + 45 € Z[x] is primitive. 


Definition 8.11 For f(x) € Q|z], the content of f, written again as C(f) will be 
any associate of the c € Q of Lemma 8.5. 


Example 8.12 In Example 8.11, C(f) = us. 


Note that this definition of content agree with the former definition in the case 
that f(x) € R[x]. We now prove the fundamental result of this section. 
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Lemma 8.6 (Gauss) In a UFD R, the product of two primitives in R\x] is again 
primitive. 


Proof 8.17 Let f(x), g(x) € R[x] be primitive and set h(x) = f(x)g(x). Express 
f(x) = anx" +--+ +a,2 +490, g(£) = bmv™ + +++ +b)0 + bo and h(x) = Cmynz™t” + 
-+»+c% +09. Now suppose to the contrary that h(x) were not primitive. Then C(h) 
is a non-unit and so in R a UFD we may factor C(h) into irreducibles which are 
also primes. Hence, there exists a prime p € R dividing C(h). Let’s fix this prime 
and consider the factor ring R/(p). Fora € R, we will use the shorthand notation @ 
to signify a+ (p) € R/(p). Since p is prime, this implies the ideal (p) is prime and 
so R/(p) is an ID. Now consider the polynomial ring (R/(p))|x] which is thus also 
an ID. Let’s establish the notation f(x), g(x), h(x) € (R/(p))[x], where, for instance, 
f(x) =Gi2z"+-+-G@xt+a@. Just as h(x) = f(x)g(x), so does h(x) = f(x)g(x), since 


G = se arb) = S- aby = > Grr. 


i=k4+l i=k+l i=k+l 


Notice that since p divides C(h) this implies € = 0 for all i and so h(x) is the 
zero polynomial in (R/(p))[x], however neither f(x) nor g(x) are the zero polynomial. 
Indeed, since f(x) is primitive, its content is 1 and so no prime can divide C(f). In 
particular p does not divide C(f) and so there exists an a; such that p does not divide 
a;. Thus, @ #4 0 and therefore, f(x) has non-zero coefficients (the same argument 
works for g(x)). Therefore, f(x) and G(x) are zero divisors in (R/(p))[x], which 
contradicts that (R/(p))[x] is an ID. 


Corollary 8.4 Let R be a UFD and Q = Q(R) with f(x), g(x) € Q[z]. Then 
C(fg) = C(f)C(g). 


Proof 8.18 By Lemma 8.5, we can write f(x) = cf(x) and g(x) = dg(x) for some 
c,d € Q and f(x), g(x) € R[x] primitive. So then f(x)g(x) = cdf(x)g(x) and 
f(x)G(x) is primitive by Gauss’ Lemma. Therefore, C( fg) = cd = C(f)C(g). 


Example 8.13 Consider the polynomial f(x) = x?—5x+6 € Z[2]. The factorization 
x? —5x+6 = (2x—$)(3ax—5) attests to the fact that f(x) is not irreducible in Q{z]. 
Now f(z) is, in fact, not irreducible in Z[x], since x? —5x +6 = (x—2)(x—3). Using 
Gauss’ Lemma, we will show that it is always the case that if a polynomial factors 
over the quotient field, then it factors over the UFD. We can illustrate the proof with 


this example: 


3 6 3) 3 3) 
—50+6= (Fa ) (32 5) = (2 — 2)-(x% — 3) = (a — 2)(x — 3). 

3) 3) 3 
Corollary 8.5 Let R be a UFD and Q = Q(R) with f(x) € Rix] of degree at least 
one. If f(x) is irreducible in Rix], then f(x) is irreducible in Q[ax]. In particular, if 
f(x) is irreducible in Z|x], then f(x) is irreducible in Q|z]. 
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Proof 8.19 We prove the contrapositive statement (as illustrated in the example 
above). Suppose that f(x) = g(x)h(a) with g(x), h(x) € Q[a] non-units. In Q[2] 
this means the degree of g(x) and h(x) are at least one. By Lemma 8.5, we can write 
g(x) = cG(x) and h(x) = dh for some c,d € Q and g(x), h(x) € R[x] primitive. Note 
that by a simple degree argument it follows that deg(g) = deg(g) and deg(h) = deg(h). 
Furthermore, since f(x) € R[x] we know on the one hand C(f) € R, and on the 


other hand C(f) = C(g)C(h) = cd. Thus, cd € R and f(x) = [cdg(x)|h(x) where 
cdg(x), h(x) € R[x] are non-units. 


Example 8.14 The converse of Corollary 8.5 is nearly true, i.e. if f(x) is irreducible 
in Q|a], then f(x) is irreducible in Rix]. The problem is that sometimes the content 
of f(x) can act as a non-unit factor. Consider f(x) = 2x —4 € Zz]. This is cer- 
tainly irreducible in Q|x], since all degree one polnomials over a field are irreducible. 
However, 2x — 4 = 2(x — 2) where neither 2 nor x — 2 are units in Z[x] and so f(x) 
is not irreducible in Z|x]. However, if we assume f(x) is primitive, then we can show 
the converse holds. 


Corollary 8.6 Let R be a UFD and Q = Q(R) with f(x) € Riz] primitive of degree 
at least one. If f(x) is irreducible in Q|x], then f(x) is irreducible in R[x]. 


Proof 8.20 We prove this again using the contrapositive statement. Suppose that 
f(x) = g(x)h(a) where g(x), h(x) € Rix] non-units. We show that g(x) (and similarly 
h(x)) has degree at least one. If deg(g) = 0, then g(x) =a € R* —U(R). Then, by 
Gauss’ Lemma, C(f) = C(gh) = C(g)C(h) = aC(h). But then C(f) is not a unit 
which contradicts that f(x) is primitive. Thus, since g(x),h(x) have degree at least 
one, this implies they are non-units in Q|x] and so the same factorization illustrates 
that f(x) is not irreducible in Q|z]. 


Remark 8.5 Jf R is a commutative ring with 1 and p © R is prime. Then p is 
prime in R|x| as well. Indeed, p prime in R implies R/(p) is an ID. Now R\|x|/(p) = 
(R/(p))[a] via the map (anx” +++» + ay + a9) + (p) > (Gn + (p))a" +++ + (a + 
(p))x+ (ao+(p)). Therefore, R[x]/(p) is an ID which in turn implies that p is prime 
in Ria]. 


Theorem 8.4 /f R is a UFD, then Ra] is a UFD. 


Proof 8.21 First of all, since R is a ID, then so is Ria]. Second, we show that 
Rix] has factorization. Let f(x) be a non-zero non-unit in R[x]. Should the de- 
gree of f(x) be zero, then f(x) € R which is a UFD and so factors into irre- 
ducibles. If the degree of f(x) is greater than zero, then f(x) is a non-unit in 
Q|[x]. Since Q[a] is an ED, it is also a UFD and so f(x) factors into irreducibles 
in Q[a], i.e. f(x) = pi(x)po(x)-++pn(x) where each p(x) € Q[a] ts irreducible. 
By Lemma 8.5, write each p(x) = cipi(x) where each G € Q and p(x) € Ria] 
primitive. By Corollary 8.6, each p;(x) is irreducible in R[x]. By Gauss’ Lemma, 
Pilz) po(x)-++Py(x) ts primitive, and so since f(x) = c1C2-++ Capi (x) po(x) +++ Pp(x), 
it follows that C(f) = cico:++cn. Now f(x) € Rix] so then cyco-++Cn = C(f) E R 
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a UFD. Thus, c1¢2+++ Cn = (192°** Gm @ product of irreducibles in R (and so in R[x} 
as well). Hence, f(x) = q192° +: GmPi(X)p2(x) +++ Pn(x) ts a factorization of f(x) into 
irreducibles in R[z]. 

Finally, we show that R{x| has unique factorization. By Lemma 8.4, it’s enough 
to show that every irreducible in Rix] is also prime. Let f(x) be irreducible in R[z]. 
Should f(x) have degree zero, then f(x) = p € R and so, since R is a UFD, p is 
prime in R. By Remark 8.5, f(x) = p is also prime in R[x]. If deg(f) > 0, then by 
Corollary 8.5, f(x) is irreducible in Q|a]. Since Q|a] is a UFD, it follows that f(x) is 
prime in Q|a]. Now suppose f(x) divides g(x)h(x) in Ria], where g(x), h(x) € Riz] 
Then f(x) certainly divides g(x)h(x) in Q[a]. Now f(x) is prime in Q[a] so that 
either f(a) divides g(x) or divides h(x) in Q|x]. Without loss of generality, assume 
f(x) divides g(x) so that g(x) = f(x)p(x) for some p(x) € Qa]. By Lemma 8.5, 
write p(x) = cp(x) where c € Q and p(x) € R{z] primitive. On the one hand, 
C(g) = C(fg) = C(f)C(p) = ¢ (note that f(x) irreducible in Rix] implies f(x) is 
primitive, for otherwise its content would be a non-unit factor). On the other hand, 
since g(x) € Ria], its content is in R. Hence, c € R which implies p(x) € Ria] and so 
f(x), in fact, divides g(x) in R[x]. Thus, we have shown that f(x) is prime in R{z}. 


To finish this section we present two results which can be useful in determining 
irreducibility called the Rational Root Theorem and Eisenstein’s Criterion. 


Theorem 8.5 (Rational Root Theorem) Let R be a UFD and Q = Q(R) with 
f(@) = ana” +++» + a," + a9 € Ria] of degree at least one (an # 0). If } € Q is a 
root of f(x) with gcd(s,t) =1, then slap and tlan. 


Proof 8.22 Since f(s/t) =0, this implies that a,(s/t)" +---+a1(s/t) +ao =0 and 
multiplying through by t” yields ans” + ans" 't +--+ + a,st™! + aot” =0. 

On the one hand ays” = —t(an_1s”- 1 +--+ + ayst”~? + agt”1) which implies t 
divides ays", but since gcd(s,t) = 1 it follows that t must divide an. 

On the other hand (ans"~+ + an_1s"~*t + +++ + ayt”~1)s = agt” which implies s 
divides aot”, but since gcd(s,t) = 1 it follows that s must divide ao. 


Example 8.15 Consider the polynomial f(x) = 4a34+2x?—x—5 € Z[z]. The factors 
of ag = 5 are +1,+5 and the factors of a, = 4 are +1,+2,+4. Hence, should f(x) 
have a root in Q it would have to be among the candidates 


1 1 1 5 5 5 
Le a Toes ee Ae 


The only candidate that works is 1. Should no candidate have worked, then we 
could conclude that f(x) is irreducible in Q|a]. Indeed, should f(x) factor it would 
have to have a linear factor (since deg(f) = 3) and hence a root in Q. 


Theorem 8.6 (Eisenstein’s Criterion) Let R be a UFD and Q = Q(R) with 
f(t) = ana” +--+ + a,x + a9 € Rix] of degree at least one. Suppose there is a 
prime p © R such that 


ses ieee 
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2. p|a; fori=0,1,--- ,n—-1. 
3. p* tao. 


Then f(x) is irreducible in Q[x]. 


Proof 8.23 Suppose, to the contrary, that f(x) were not irreducible in Q|a]. Then 
by Corollary 8.5, f(x) would not be irreducible in R[x]. Thus, f(x) = g(x)h(x) where 
g(x) = byx*® +---+byrt+bo € Ria] with by 4 0 and h(x) = cmax™ +++» +e,a+e € R[z] 
with Cm # 0. Since p|ap and ap = boco and p is prime, it follows that plbo or p|co. But 
p’ does not divide ag so we may conclude that p|bo or p|co, but not both. Without loss 
of generality, assume that p divides co but not bo. Since p does not divides dn = bcm, 
it follows that p does not divide cm. Let r be smallest such that p does not divide cy 
(note that1 <r <m<_n). Consider the coefficient a, = boc, + bycp—1 + +++ + b,€0 
and rewrite as a, — b\Cp_1 —+++— b-Co = boc. Since pla,-, p|bicp_1,..., plbpco, it must 
be that p|boc, and so p\bo or plc, neither or which is a true statement. Hence, f(x) 
must be irreducible in Q[x]. 


Example 8.16 We illustrate Eisenstein’s Criterion with several examples. 


1. Consider the polynomial f(x) = 10x? + 152 +6 € Z{x] and the prime p = 3. 
Since 3 { 10, 3|15, 3|6 and 9 { 6, by Eisenstein’s Criterion, f(x) must be 
irreducible in Q|x] (and since f(x) is primitive it is also irreducible in Z|a] ). 

2. If no prime satisfies Kisenstein’s Criterion for a given polynomial, we cannot 


conclude that the polynomial is not irreducible. Take, for instance, x7+1 € Z[z| 
which is certainly irreducible (for otherwise i € Z). 


3. We will apply the following statement which is left as an exercise: Suppose 
f(x) € R[x] and there exists anr € R such that f(a+r) is irreducible in Q{z]. 
Then f(x) is irreducible in Q[z]. 


Consider again f(x) = 27 +1 € Zz]. Notice that f(x +1) = (x +1)? +1= 
x? +2xr+2 which is irreducible in Q|2] (Eisenstein with p = 2). Therefore, f(x) 
is irreducible in Q|x] (and since f(x) is primitive it is also irreducible in Z|x]). 


EXERCISES 


1 Decide whether or not each of the following polynomials in Z|z] is irreducible 
over Q|z] and irreducible over Z[z]. 


a. 1022+ 6x? — 182 +12 
ty 2 Fae ED 


ce. 24-102? +1 


e. 32° — 622 + 282 — 14 
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f. Qa4 — 323 — 2? -—a2+1 
g. o4+22?-1 
2 Let R={m+2ni : m,ne Z}. 
a. Prove that Q(R) = Q(i). 
b. Show that x? + 1 is reducible over Q(R) but not over R 


c. Use the previous parts to conclude that R is not a UFD. 


3 Prove if anu" +--+ +a 1x + a9 € Fz] is irreducible, then so is agz” +--+ + 
An—1L + An. 


4 Suppose f(x) € R[x] and there exists an r € R such that f(# +r) is irreducible 
in Q[z]. Prove that f(x) is irreducible in Q[z]. 


CHAPTER 9 


Field Theory 


N THIS CHAPTER, we set the stage for the next chapter on Galois theory. In 

Section 9.1, we remind the reader of some definitions, concepts and results as well 
as introduce the definition of algebraic. In Section 9.2, we investigate field extensions 
which allows us to look at chains of fields, an important idea in Galois theory. This 
also allows us to investigate some famous geometric impossibilities in Section 9.3. 
In Section 9.4, we look at some particular and important field extensions, and in 
Section 9.5, we prove the existence of some of these structures we introduced. Finally, 
in Section 9.6, we completely classify finite fields. 


9.1 REVIEW AND ALGEBRAICITY 


We need to review some ideas from general ring theory, but present them from a 
different perspective. First, there is the notion of a polynomial extension of a ring. 


Definition 9.1 Let S' be any subring of a commutative ring R and take r © R. The 
ring of polynomials in r over S, written 


S[r] = {agr? +--+ + ayr + ag > a0, @1,-..,ag € S andd > 0}. 


We leave it to the reader to show that Sr] (see Figure 9.1) is indeed a ring (show 
it’s a subring of R). In fact, one can show that S{r] is the smallest ring containing 
both S and r in the sense that if T is a ring and SC T andr € T, then S[r] C T. 


Example 9.1 We have run across several examples of this structure: ZJi], Z[W2], 


Z\V—5], Ql. 


Second, we have S|z], the ring of polynomials in x over S, where z is an 
indeterminate and S is a commutative ring. 

Third, we have a map which connects these two structures above. Recall the 
evaluation homomorphism 


®, : S[x] > S[r]_ by ®, (age? +++ + a,x + ag) = agr? +--+» +air+ ap. 


NO 
—_ 
o 


DOI: 10.1201/9781003335283-9 
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Figure 9.1 A visual representation of |r]. 


The reader should note that the evaluation map is indeed a homomorphism which 
fixes S and maps onto S[r]. For brevity, we may rephrase this epimorphism using 
functional notation as follows: If we set f(x) = agx4+---+ a,x +a, then f(r) = 
agr?+++++ayr+ag and so ©,(f(x)) = f(r). 

An important concept in field theory is the notions of algebraic versus transcen- 
dent, which we now define. 


Definition 9.2 Letre RDS rings. 


1. We sayr is algebraic over S if there exists a non-zero polynomial f(x) € S|z] 
such that f(r) =0, i.e. r is a zero of some non-zero polynomial in S|a} 


2. We say r is transcendental over S if no such polynomial exists. 


Example 9.2 Here are several examples which illustrate the definitions. 
1. i€ C is algebraic over Z, since i is a zero of f(x) =27 +1 € Zfz]. 
2. /2 ER is algebraic over Z, since V2 is a zero of f(x) = x? —2 € Za]. 
3. iv/2 is algebraic over Z, since iW2 is a zero of x4 —2 € Z[z]. 
4. iW? is algebraic over Z[/2|, since iW2 is a zero of x? + V2 € Z[V2\ [2]. 
5 


. It can be shown that both 7 and e in R are transcendental over Z. 


Note that, r being algebraic over S is equivalent to there being an element agr?+ 
+» +ayr+aq € S[r] with ag # 0 and agr? +--+» +a;r+ ao = 0, ie. there is a 
non-trivial element in S{r] which equals zero. This also implies that elements in S[r] 
are not uniquely represented as elements of S/r] (just consider zero which can be 
represented as 0 and also as agr¢+--++a,r-+ao for some agr?+---+ayr+ag € Slr] 
with ag # 0 — see Theorem 9.1.3 for what is meant by unique representation in 
S{r]). This is yet another equivalent way to define r being algebraic over S. We now 
summarize these statements in a theorem and leave the details of the proof to the 
reader. 
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Theorem 9.1 Let S be any subring of a commutative ring R and take r € R. The 
following statements are equivalent: 


1. r is transcendental over S. 


2. There is no element agr? + +--+ ayr+ao € S[r| with ag 40 and agx? +--+ + 
ayx + ap = 0. 


3. If (agr? +++-+ayr+ao) = (ber? +-+-+bir+bo) in Slr], then d =e and a; = 6; 
fori=1,2,...,d. 


4. ker®, = {0}. 
5, S[z] = Slr]. 


One can interpret the last statement in the theorem as saying if r is transcendental 
over S, then in a sense (with respect to S) r is just like an indeterminate. 


Example 9.3 Here are some examples which further illustrate unique representation 
and Theorem 9.1. 


1. i € C is algebraic over Z since i? +1 is a non-trivial element of Z{i] which 
equals zero. Hence, we do not have unique representation in Zli] (for instance, 


observe how i? + 1 = —i+1). Notice also that i? +1 =0 implies that i? = —1, 


e®=—-i, it=1, etc., so that 


Z|t] ={m+ni : m,ne Z}, 


and thus no higher powers of i are needed to represent elements in Zli|. Fur- 
thermore, this description of Zli| has unique representation. 


2 
2. 2 ER is algebraic over Z since (v2) — 2 is a non-trivial element of Z| V2] 
which equals zero. In this case, 


Z[V2| ={m+nvV2 : m,neZ}, 
and this description has unique representation. 


3. One can show there is no non-trivial element of Z|x|] which equals zero. There- 
fore, there is no simpler way to describe Z|n] besides with the original definition, 


Zr] = {agr? +--+ ayn +aq : a9,01,...,ag € Z and d > 0}, 
and this description has unique representation. 


We now focus our attention on fields and address the same material. Our choice 
of variable names will change a bit. Instead of r € RD S, we will havea € ED F 
with FE and F fields. Let’s look again at the evaluation epimorphism in this context, 
namely 


®,: Fla] > Fla] by ®.(f(2)) = f(a). 
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In this context we know a bit more. First, since F’ is a field, we know that F[z] 
is a PID. Therefore, the ideal, ker®, = (p(x)), for some p(x) € Fx]. Now since ®, 
is an epimorphism, by the Fundamental Theorem of Homomorphisms, we have 


Fla]/(p(a)) = Fla]/ker®q = Fial. 


We now simply restate Theorem 9.1 in the context of fields with no further proof 
required. 


Theorem 9.2 Let F' be any subfield of a field E and take a € E. The following 
statements are equivalent: 


1. a is transcendental over F. 


2. There is no element aga? +--+» +aja+ao € Fla] with ag 4 0 and agx? +--+. + 
a,x + apg = 0. 


3. If (aga4 + +++ + a,a+ a9) = (bea® +--+ bia + bo) in Fla], then d = e and 
a; = 6; fori =1,2,...,d. 


4. ker Oi {0}: 
5. Fla] = Fla. 


EXERCISES 


1 For each r € RD S a commutative ring, verify that r is algebraic over S: 


a VY-5EC 2Z. 
b. V3ERDZV3}. 


c. VW2ERDZ. 
d. ##/2€C DZ. 


e. VW3ERDZ[V3). 


2 For each problem in Exercise 1, express elements of S[r] with unique represen- 
tation. 


3 Let S be any subring of a commutative ring R and take r € R. 
a. Prove that Sr] is a ring. 
b. Prove that S[r] is the smallest ring containing S' and r. 


c. Verify that the evaluation map 
®, : S[z] > S[r] by ©,(agx? +--+ + ar +a) = agr? +--+ ar +ao, 
is a homomorphism which fixes S and maps onto Sr]. 


4 Prove Theorem 9.1. 
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9.2 VECTOR SPACES & EXTENSION FIELDS 


The goal of this section is to make a connection between fields and vector spaces 
and develop some results based on this connection. The reader may wish to review 
some basic concepts in Linear Algebra such as vector space over a field, basis and 
dimension. 


Definition 9.3 [f F is a subfield of a field E we call E an extension field of F 
and F is called the base field. The notation we shall use is F C E. 


In this context one can view F as a vector space. The vectors will be the elements 
of E, scalars the elements of F’, vector addition will be the field addition in FE, and 
scalar multiplication will be the field multiplication in E. The dimension of the vector 
space £ over the field F' will be denoted by |F : F] and is called the degree of EF 
over F’. If [E: F| < co we say that E is a finite extension of F’. There is a very 
good reason why this notation is employed and is identical to [G : H], the index of a 
group G over a subgroup H, but this will take some time to develop. 


Example 9.4 [C: R] = 2 with basis 1,i. Indeed, 1,i span C since every element of C 
can be written as a real linear combination of 1 andi. They are linearly independent 
over R, since if for a,b € R we have a-1+6-% = 0, then a+ bi = 0 and so 
a = b = 0 (otherwise i would be an element of the reals or C would have zero 
divisors, a contradiction). 


We leave it to the reader as exercises to show that 
I (eer a a BaP 


2. If F C K C E are fields and [EF : F] < oo, then [FE : K| < [E : F] and 
[KA : FF] <[E: F). 


This next result hints at a connection between [EF : F| and [G: H]. 


Theorem 9.3 [fF CE CK are fields with [K : E),[E: F] < 00, then [K : F] < 00 
and |K: F] =|[K: E|[E: F. 


Proof 9.1 Set [kK : E] = m and [E : F| = n and suppose that E has basis 
@1,42,...,Q4n over F and K has basis 61, b2,...,bm over E. We will show that the set 
of products {ajb; : 1<icn, 1<j< mb} forms a basis for kK over F, and so the 
result will be proved. We first note that these products a,b; are distinct, becauseof the 
linear independence of by, b2,...,bm over E. 

To show they are linearly independent over F’, suppose that )7¥") S071 Cij(aidj) = 
0 for some cy € F. Then Sh (oik1 Cijai) bj = 0 where ST, cija; € E. Since the 
b;’s are linearly independent over E, we have that >>", cija; = 0, for all j. But since 
the a;’s are linearly independent over F’, we have that ¢,; = 0 for alli and for all j. 
To show the set of products span K over F’, take any element c € K. Since the b;’s 
span K over E, we can write c= je cjb;, for some c; € E. Since the a;’s span E 
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over F, for each j we can write cj = Yi, dija; for some dij € F’. Putting all this 
together we have 
n m 


c= > cyby = D0 >) digasby = D7 dig (audy), 


j=l j=l i=l i=1 j=l 


and so we have written c as a linear combination of the products with scalars from 


F. 


Notice the parallel to the group theory result [G : H] = |G: K][Kk : H] for 
H < K <G, which was proved in a very different way appealing to an equivalence 
relation. 

We need a bit of notation to continue our discussion. Let F(x) represent the field 
of quotients of the integral domain Fz], i.e. 


FeO (Fig) = {a0 : f(x), g(x) € Fla] and g(x) is not the zero polynomial} 


For a € E D F, let F(a) represent the field of quotients of the integral domain 
Flal, i.e. 


: f(a), g(a) € Fla] and g(a) of 


Lemma 9.1 Forae ED F fields, the following statements are equivalent: 
1. a is algebraic over F. 
2. Fla| = F(a). 
3. (F(a) uF] < 08. 


Proof 9.2 We first show the first statement implies the second. As we pointed out 
earlier, given that a is algebraic over F it follows that Fla] = F|z|/(p(x)) for some 
non-zero polynomial p(x) € F |x]. Since Fla] C E, a field, the ring Fla] is, in fact, 
an integral domain, and via the isomorphism above, F|x]/(p(x)) ts also an integral 
domain. Therefore, p(x) is prime (see Exercise 13 in Section 8.1). Since F |x] is an 
integral domain, p(x) is irreducible in F|x] as well. This in turn makes (p(x)) maximal 
among principle ideals (see Exercise 17 in Section 8.1). Since Fa] is a PID, (p(x)) 
is, in fact, a maximal ideal and so Fla] = F|x|/(p(x)) is a field. Now if Fla] is a field, 
then its quotient field is no larger than itself; more precisely, F(a) = Q(F[a]) = F[al. 

Now we show the second statement implies the third. We are given that F(a) = 
Fla] = F[x|/(p(x)). Now p(x) cannot be the zero polynomial, for otherwise the field 
F(a) would be isomorphic to F|x] which we know is not a field. Furthermore, p(x) 
is not of degree zero for then it could not have any zeros. Set d = deg(p) > 1. For 
brevity, let’s establish the coset notation f(x) = f(x) + (p(x)) € Fla]/(p(x)). We can 
view F'|x|/(p(x)) as a vector space over F as follows: Vector addition will be the usual 
coset addition and scalar multiplication will be defined as 


c f(x) =cf(x) which equals Cf(z). 
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We show that the vectors 1,Z,...,2¢" form a basis for F[z]/(p(a)) over F. Once 
we show this, the third statement is then true, since the dimension of F|x|/(p(x)) over 
F equals d and F\a|/(p(a)) = F(a) as vector spaces over F, so that the dimension 
of F(a) over F also equals d, i.e. [F(a) : F] = d < o. First, they are linearly 


independent since if 


agl + a,%+-+:+ag 12%! =0, for some a; € F, then 


dg + aye +++» + ag_1x4-1 = 0, which implies ag + ayx + +++ + ag_12% € (p(z)) 


Set f(x) = ay + aya +--+ ag1x4 1 and so we have f(x) = p(x)g(x) for some 
g(x) € Fla]. Notice that 


d>d—12> deg(f) = deg(p) + deg(g) = d+ deg(g), 


which implies that deg(g) <0, i.e. g is the zero polynomial. But then so is f = pg the 
zero polynomial, t.e. ag = a, = ++: = dad-1 = 0, and so linear independence is shown. 

These vectors 1,Z,...,x2¢1 also span F[x]/(p(x)). Take any f(x) € F[z]/(p(a)). 
Since Fx] is an ED we have f(x) = p(x)q(x) + r(x) for some q,r € F[z] and 
deg(r) < deg(p). Set r(x) = ay t+ayx+++++ ag 1x. Then 


f(x) = p(x) Oz) + r(x) = r(x) = aol + a% +--+ + dg xe! 


Hence, we see that f(x) is a linear combination of 1,z,...,2¢!. 

Now we show third statement implies the first by proving the contrapositive state- 
ment. If a is transcendental over F, we’ve shown that Fla] = F [x]. Since 1,x,27,... 
forms an infinite basis for F |x] we have [F|a] : F] = co and so [Fla] : F] = o. 
Now since F(a) contains an isomorphic copy of Fa], then it’s also the case that 
[F'(a) : F] = 00 as well. 


The polynomial p(x) mentioned in the proof of the Lemma is called the irre- 
ducible (or minimal) polynomial of a over F’. We need one last result before 
we look at some specific examples in detail. This result will make it easier for us to 
identify this p(a) in concrete settings. 


Lemma 9.2 Fora € E D F fields and p(x) € F\a]*, the following are statements 
equivalent: 


L. ker = (wlx)) 
2. p is irreducible with p(a) = 0 


3. Forall fe Fiz], fia)=0 if olf. 


Proof 9.3 The third statement being equivalent to the first is straight forward. First 
assume the third statement is true and notice that 


f eker®, iff fla) =0 if lf iff f € (p(2)). 
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Now assume the first statement is true and observe that 


f(a) = 0 iff f € ker®, iff f € (p(x)) af plf. 


Now we show that the first statement is equivalent to the second. First assume the 
first statement is true. The second statement follows from our earlier work, for we 
proved in Lemma 9.1 that F(a) = Fla] = F{x|/(p(x)) where p is irreducible (note that 
a is algebraic over F,, since p is not the zero polynomial, so that ker®, is non-trivial). 
Furthermore, p(a) = 0 since p € ker®,. Now assume that the second statement is 
true. Set ker®, = (f(x)) and we will show that f and p are associates. Since p(a) = 0 
this implies that p € ker®, = (f(x)) and so f\|p. Since p is irreducible, f is either 
a unit or an associate of p. However, f cannot be a unit, for this would imply that 
ker®, = (f(x)) = Fla] and this would mean that every polynomial in Fx] has the 
element a as a zero, an obvious contradiction. Hence, f is an associate of p, and so 


(p(x) = (F(#)) = ker®a. 


Observe that the third statement in the Lemma is what gives p the alternate 
name of minimal polynomial of a over F’. 

Lemma 9.2.2 gives us a practical way of finding the irreducible polynomial of a 
over F’. We simply need to find an irreducible polynomial which has a as a zero. 
This in turn allows us to compute the value of [F(a) : F], since as in the proof 
of Lemma 9.1, [|F'(a) : F'] is equal to the degree of the irreducible polynomial of a 
over F’. The reader should verify that such an irreducible polynomial is unique up to 
associates. 


Example 9.5 We compute the degree of some extension fields. 


1. Now [Q(i) : Q| = 2, since x? +1 € Q[z] is irreducible over Q with i as a zero 
(see Example 8.16.3). 


2. Now [Q(W2) : Q| = 4, since x4 +2 € Q[z] is irreducible over Q (Eisenstein 
with p = 2). 


3. We compute [(Q(\/1+ V3) : Q| = 4 as follows: Set a = \/1+ V3. Then 
a? =1+ V3 and (a? — 1)? =3 and so at — 2a? -2=0. Therefore, a is a zero 
of x — 2x — 2 € Q{z] which is irreducible over Q (Eisenstein with p = 2). 


4. We compute [Q(\/1+ V2) : Q) = 4 as follows: Set a = 1+ V2. Then 
a? = 1+ V2 and (a? — 1)? = 2 and so at — 2a — 1 = 0. Therefore, a is a 
zero of x4 — 2x? —1 € Q|z] which is irreducible over Q. To see this evaluate 
p(x +1) = a+ + 4x3 + 42? — 2 which is irreducible over Q (Eisenstein with 
p= 2). 


Remark 9.1 It’s easy to prove thata € E D F fields and a is a zero of f(x) € Fz] 
implies [F (a) : F'] < deg(f) (exercise). 
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Example 9.6 Let’s illustrate the use of Remark 9.1. First, (Q(W2) : Q(v2)] < 4, 
since V2 is a root of x* — 2 € Q[z] which has degree 4. In fact, we can do better, 


[Q(72) : Q(V2)] < 2, since W2 is a root of x2 — V2 € Q(V2)[x] which has degree 2. 


Before we look at some examples in depth, we need to define the some additional 
concepts. 


Definition 9.4 Let a1,a2,...,dn € E D F fields. We define F(a1,a2,...,an) to be 
the smallest subfield of E containing a1, a2,...,@, and F. We say that E is finitely 
generated over F if E = F(ai,a2,...,@n) for some aj, a2,...,4n € E, and the 
G1, 42,---,@n are called the generators of F over F’. 


Remark 9.2 Observe that this definition agrees with and generalizes the definition 
of F(a). Furthermore, the reader should check that for any k, 1<k <n that 
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Example 9.7 We illustrate how one can reduce the number of generators for an 
extension field. These facts will be useful in Example 9.8. 


1. Set E = Q(vV2,iW2). We show that E = Q(iv2). Indeed, since —(iv/2)? = V2, 
the field Q(iW2) contains V/2,iW/2 and Q. Therefore, since E is the smallest 
field containing V/2,iW/2 and Q, we have that E C Q(iw2). The reverse inclu- 
sion is evident, since E contains i¥2 and Q and Q(iw2) is the smallest field 
containing iv/2 and Q. 


2. Set E = Q(V2, V2). We show that E = Q(W/2). Indeed, notice that 
V2 = 26 — 9/2(9-1/3) — /2(92)-1 € E, 
so that Q(W2) C E. Likewise /2 = (¥/2)3 and W/2 = (W/2)? so that E C Q(W2). 
Example 9.8 We illustrate how one can compute the degree of a finite extension. 


1. Set E = Q(V2,ivW2). We wish to compute [E : Q|. We will compute this in 
two different ways (for sometimes only one of these ways may be available in 
certain situations). 


For the first approach, we’ve seen in Example 9.7.1 that E = Q(iw2). Hence, 
to compute [E : Q| we need only find the irreducible polynomial of ix/2 over Q. 


Now this is easy, since x4 +2 is irreducible over Q (use Eisenstein’s Criterion 
with p = 2) with iv/2 as a zero. Therefore, 


[E : Q| = [QiV2) : Q| = deg(a* + 2) =4. 
A second approach to computing |E : Q| is to make use of Theorem 9.3. Set 


E, = Q(vV2) and note that QC E, C E with E = E\(iv2). Now [E, : Q=2 
since x” —2 is irreducible over Q (Eisenstein with p = 2) and has V2 as a zero. 
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The index |E : Ey| = 2 as well. To see this, first note that ix/2 is a root of 
x? + 2 € E\[z] so that 


[E : e1] < deg(x? + V2) =2. 


Now, [E : E,] cannot equal 1, for if it did, then E = E,. But then iv/2 € 
E, = Q(v2), a contradiction (Since R is a field containing Q and V2, then R 
contains Q(/2) the smallest such field. Hence, if i/2 € Q(V2), theniW2 ER 


and soi € R, a contradiction). Therefore, 


[E:Q|=[B: E\l[Ey : Q) = (2)(2) =4. 


2. Set E = Q(vV2, W2). We wish to compute [E : Q). This example illustrates yet 
another style of approach for computing indexes. Once again we consider the 
chain of fields Q C Q(V2) C E. We just saw in Example 9.8.1 that [Q(V2) : 
Q] = 2. Thus, 


[E: Q)=[E: Qv2)[Q(v2) : Q = 2-[E: Qv2)], 


and so 2 divides [E : Q). In a similar manner, consider the chain of fields 
Q C Q(W2) C E. Observe that [Q(W/2) : Q] = 3 since x? — 2 € Q{x] is the 
irreducible polynomial of W/2 over Q (Eisenstein with p = 2). Therefore, as 
above, we get that 3 divides [E : Q). Since 2 and 3 are relatively prime, we have 
that 6 divides [E : Q| and so |E : Q| > 6. We now show the reverse inequality 
(and hence, [E : Q| = 6). Let’s set Ey = Q(V2) so that Q C Ey C E with 
E = E,(¥V/2). Then 
[E : Fi) = [Fi(V2) : Fi] <3, 


since x? — 2 € Ey [x] of degree 3 with V/2 as a zero (not necessarily irreducible 
and Eisenstein doesn’t apply here). Therefore, 


[E : Q) = [EB : Qv2)|[Q(v2) : Q = 2- [BE : Q(v2)] < (2)(3) = 6. 


There is yet another (easiest) way to compute this degree. We’ve seen in Exam- 
ple 9.7.1 that E = Q(W2). Now x®—2 € Q[a] with x°—2 irreducible (Eisenstein 
with p = 2) and having V/2 as a root. Therefore, 


[E : Q| = [Q(V2): Q =6. 


The next result besides its theoretical importance has practical implications, for 
it allows us to construct finite fields. 


Theorem 9.4 Leta € ED F fields with a algebraic over F. Let p(x) € F |x] be the 
irreducible polynomial for a over F with d = deg(p). Then 1,a,a?,...,a¢~! 
basis for F(a) over F. 


; OTms G 
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Proof 9.4 As was mentioned at the beginning of this section, we know that |F(a) : 
F| = d, so it’s enough to show that 1,a,a7,...,a¢! are linearly independent over 
F.. Suppose, to the contrary, that 1,a,a”,...,a7 are linearly dependent. Then there 
would be scalars ag,a1,...,@g-1 € F not all zero with agpl + aya+---+ag_1ae! = 0. 
Set f(z) = an +aie+---+ag 127 € F|az] so that f(a) = 0. By Lemma 9.2 we 
know |F'(a) : F] < deg(f) < d—1, a contradiction. 


Example 9.9 We use Theorem 9.4 to given explicit descriptions of F(a) when a is 
algebraic over F. 


1. Now i € C is algebraic over R with irreducible polynomial p(x) = x? + 1. 
By Theorem 1, the elements 1,i form a basis for R(t) over R. Therefore, 
R(i) = {al + bi : a,b € R}. In other words, R(i) = C, i.e C is the smallest 
field containing both i and R, which should come as no surprise to the reader. 
Note that in general, the number of generators of a field will be different than 
the number of elements in a basis. For instance, in this excample there is one 
generator i and two basis elements 1,1. 


2. We’ve seen that \/2 is algebraic over Q with [Q(V2) : Q| =3 and so by Theo- 
rem 9.4, a basis for Q(W2) over Q is 1,2'/3 22/3. Therefore, 


Q(W2) = {a + 62/3 + 27/3: a,b,c € Qh. 


3. Theorem 9.4 allows us to construct finite fields of any prime power order (which 
we will see later are the only sizes allowed for finite fields). To illustrate this 
we will construct a field of order 4 = 27. Set F = Zz and p(x) = a7 +2+1€ 
Z2|x]. Notice that p(x) is irreducible over Zy, for otherwise p(x) would have 
a linear factor, say x —c for some c € Zg and so c would be a zero of p(x). 
However p(0) = 1 4 0 and p(1) = 1 £ 0, so this cannot be true. Therefore, 
let’s formally define a new symbol a to be a zero of p(x) so that p(x) is the 
irreducible polynomial of a over Zg. By the theorem we know that 1,a forms a 
basis for Z2(a) over Zz so that 


Zo(a)={c-l+d-a: cd€Zo}={c+da : c,d€ Z.}= {0,1,a,1+ a}. 


Note that since there are two choices for c and for d, we see that Z2(a) is a 
field with 4 elements. Up to this point, we have never run across a field of such 
a size. Recall that the only finite fields we were aware of were Z, for some 
prime p. Let’s write the addition and multiplication tables for this field. Let’s 
see a few examples of adding and multiplying before we exhibit the whole table 
(which the reader should be able to reproduce). First note that since a is a zero 
of p(x) = 27 +a2+4+1, this implies that a? +a+1=0 ora*=a+t1 in Zp. For 
instance, we can add (1+a)+a=1+2a=1 and multiply (1+a)-a=a+a? = 
a+(a+1)=1. Figure 9.2 exhibits the complete addition and multiplication 
table. 


Field Theory @ 227 


Figure 9.2 The addition and multiplication table for a field with four elements. 


EXERCISES 


1 Compute each of the following: 


a. 
b. 
c. 


d. 


QvI+i):Q 

Q(v3, 73) : Q| 
Q(V2,iVv3) : Q 
Q(v2, 73) : Q 


2 Let F=Z; and f(r) = 22 +27+1€ Fiz]. 


a. 


b. 


Cc. 


Carefully explain why f(z) is irreducible in Fz]. 


Let a be a root of f(x) in some extension of F’. Give a description of F(a) 
and compute its size. 


Compute (3a? + 2a + 1) + (3a? + 3a + 2) and (a? + 2)(a? + 3a) in F(a). 


3 If F is a subfield of a field EF, prove that 


a. 


b. 


[E:F]=1 iff E=F. 


If F C K C E are fields and [EF : F] < ov, then [EF : K] < [E: F] and 
[kK : F] < [E: Fi. 


4 Suppose FC K C E are fields with [E : F] < oo. Prove that if kK is properly 
between F and EF, then [E: K] and [Kk : F] are strictly less than [E’: F). 


5 Verify that the irreducible polynomial of a € E D F is unique up to associates. 


6 Verify Remark 9.1. 


7 Verify the statement made in Remark 9.2. 
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Figure 9.3 Diagram used for constructing ab. 


8 Let a € E D F be algebraic over F with [F(a) : F] = n an odd number. 
Prove that a? is also algebraic over F with [F(a?) : F] = n an odd number. 
Furthermore, show that F(a?) = F(a). 


9.3 GEOMETRIC CONSTRUCTIONS 


Imagine you have at your disposal a straightedge with two marks on it so that the 
length between the two marks is one unit length of measure (pick your favorite units 
of measure). You also have an unmarked compass and a writing instrument (like a 
pencil) and paper. 

Recall from high school geometry that certain constructions are possible with 
these tools: 


1. two line at right angles to each other. 
2. two parallel lines. 
3. the midpoint of a line. 


Definition 9.5 A real number a will be considered constructible if using these 
tools you can construct a line segment of length |a|. A subset X of the real numbers 
is constructible if every a € X is constructible. 


Theorem 9.5 If a,b are two positive constructible real numbers, then so are a+ 6, 
a —b, ab and a/b (when b 4 0). 


Proof 9.5 To construct a+b simply mark off the length of a and of b side by side. 
To construct a — b mark off the length of a and then at the end of the line segment 
of length a mark in reverse the length of b. To construct ab first construct a right 
triangle with legs of length 1 and a, then extend (or mark off on) the leg of length 1 a 
line segment of length b. Construct a line parallel to the hypothenuse of the original 
right triangle. Extend (or mark off on) the leg of length a to find the intersection of 
the parallel line constructed and the line containing the line segment of length a. 

Then one can show using similar triangles that the length of x in Figure 9.3 equals 
ab. A similar construction shows that a/b is constructible as well, which we leave as 
an exercise. 
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a 1 


Figure 9.4 Diagram used for constructing \/a. 


Corollary 9.1 If a,b are two constructible real numbers, then so are a+b, a—b, ab 
and a/b (when b 0). 


Proof 9.6 This corollary requires a proof by cases which relies heavily on the result 
just proved. 


Corollary 9.2 The rational numbers are constructible. Furthermore, the collection 
of all constructible real numbers forms a subfield of R. 


Proof 9.7 First note that the integers are certainly constructible. The rest follows 
immediately from Corollary 9.1. 


Lemma 9.3 [fa is a constructible non-negative real number, then so is \/a. 


Proof 9.8 First construct a line segment of length a and of length 1 side by side 
on the same line. Construct the midpoint of this line and construct a semicircle with 
diameter the line segment of length a+ 1. Construct a line perpendicular to the line 
segment of length a+ 1 passing through the point where the line segment of length 
a and 1 meet. Find the point of intersection of this line and the semicircle. From 
geometry we know that this point together with the line segment of lengtha+1 forms 
a right triangle. Furthermore, this altitude of the right triangle we just constructed 
indicated in Figure 9.4 has length x = \/a (use similar triangles). 


Corollary 9.3 If F' is a field of constructible real numbers and a > 0 constructible 
with a € F yet a ¢ F, then every element of the extension field F(,/a) is con- 
structible. 


Proof 9.9 Since the field of constructible numbers contains F by assumption and /a, 
by Lemma 9.8, and F(./a) is the smallest field containing F and ./a, this implies 
that F(./a) is contained in the field of constructible numbers. 
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(x,y) 


Figure 9.5 A point is constructible if its coordinates are constructible. 


Remark 9.3 The field F(,/a) is called a quadratic extension of F since x? -—a € 
F(a] is the irreducible polynomial of \/a over F and so [F(./a) : F] = 2. 


Theorem 9.6 (Characterization of Constructible Numbers) A real number b 
is constructible iff there exists a chain of fields of the form Q=Fo CPL Ch, C 
--- C Fy, such that each F; is a quadratic extension of F;-1 fori = 1,2,...,n and 
be Fy. 


Proof 9.10 First assume the existence of a chain of fields as in the statement of 
the theorem. We prove by induction on the length of the chain that b is constructible. 
For n = 0 we have b € Q which we know is constructible. Assume now that n > 
0. We are given that b € F, = Fy-1(./an) for some an > 0 in Fy_1 yet an ¢ 
F,-1. By induction, F,-1 is constructible and by Corollary 9.3, Fn = Fn—1(\/an) ts 
constructible. 

Now assume that b is constructible. Consider the xy-plane as the arena where all 
these constructions with straightedge and compass occur. Define a point (x,y) in the 
xy-plane constructible if its coordinates x and y are constructible numbers. Indeed, 
it certainly is, since we can mark off on the x-and y-axis the values x and y and then 
drop two perpendiculars and find the point of intersection as in Figure 9.5. 

Because of the tools we have for constructing, it is clear that if P,, P2, P3; and Py 
are constructible points, then so are (when they exist) 


1. The point(s) of intersection of lines P,P: and P3P,. 


2. The point(s) of intersection of line P,P: and the circle with radius the line 
segment P3P, and center P3. 
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3. The point(s) of intersection of the circle with radius the line segment P; Py and 
center P, and the circle with radius the line segment P3P, and center P3. 


In fact, these are the only ways of constructing new points with the tools available 
and hence the only way of producing constructible numbers. Hence, to finish the proof 
it is sufficient to show that all points constructed in this manner have coordinates 
which lie in at most a quadratic extension of the coordinates of the points P,, P2, P3 
and P4. 


Claim 9.1 The standard equations of the lines and circles described above have co- 
efficients all of which are constructible numbers. 


In the case of a line passing through two constructible points P,(a,b) and P2(c, d), 
the equation of the line is 


= = (y—b)(c—a) =(x—a)(d— b) 


= (d—b)x+(ce—a)y + [b(e — a) — a(d — d)| = 0. 


Note that d—6, c—a and b(c—a) —a(d-—b) are all constructible numbers. In 
the case of a circle with radius P,P, and center P,, the equation of the circle is 


(a — a)? + (y—b)’ = (c—a)? + (d— 8)? 
2 — 2ax — Qhy + [a* + b? — (c — a)? — (d —b)?] =0. 

Note that —2a, —2b and a? +b? —(c-— a)? — (d—b)? are all constructible 
numbers. Hence, the Claim is proved. 

Let’s call such a line or circle in Claim 9.1 a constructible line or circle. As 
stated earlier, the proof will be complete if we show that the points of intersection 
of constructible lines and circles have coordinates which lie in at most a quadratic 


extension of the coefficients of the constructible lines and circles. To do this we will 
consider the three cases listed above. 


2 


=> & 


1. Consider two constructible lines ax + by = c and dx + ey = f. By Cramer’s 
rule, the point of intersection of the two lines has coordinates 


c b a Cc 
= Pe ee OF _ d f _ af —ced 
a ll eeS be’ os ae hl weSte 
de F € 


Thus, x and y are constructible numbers. 


2. Consider the case of a constructible line ax +by+c = 0 and constructible circle 
r+y?+dr+eyt+ f =0. Solving the first equation for x or y (whichever is 
possible) and substituting into the second equation yields a quadratic equation 
with constructible coefficients. Then using the quadratic equation, the points of 
intersection have coordinates which lie in at most a quadratic extension of the 
coefficients of the constructible line and circle. 
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3. For the case of two circles, notice that we can reduce this case to the second 
case as follows: Let x*+y?+ax+by+c=0 andx?+y?+dr+ey+f=0 
be two constructible circles. By subtracting the two equations one obtains a 
constructible line. Then the points of intersection on the two circles correspond 
to the intersection of this line with either of the two circles, which is the previous 
case already considered. 


Hence, the proof of the theorem is complete. 


Remark 9.4 We points out a couple of things which we will make use of in what 
follows. 


1. Under the assumptions of Theorem 9.6, since each F; = Fy_1(./aj) for i = 
1,2,...,n it follows that F, = Q(,/a1, V/aa, -.., an). 


2 Sine: |Fea i, | = 2 fort = 1, 2,...490 us follows that |F,3Q|\=2" 


Corollary 9.4 [fb is a constructible number, then |Q(b) : Q| is a power of 2. Hence, 
it is not possible to construct a real number b such that |Q(b) : Q] is infinite or not 
a power of 2. In particular, b cannot be the root of an irreducible polynomial over Q 
of degree which is not a power of 2. 


Proof 9.11 Suppose that b is constructible. By Theorem 9.6, there exists a chain of 
fields of the form Q = Fo C Fi C Fy C-:: C F, such that each F; is a quadratic 
extension of F;_, fori = 1,2,...,n and b € Fy. As in the remarks above, since 
be F,, it follows that 


2” = [Fr : Q = [Fn : Q(b)][Q() = Q). 


Hence, |Q(b) : Q| divides 2” and so must be a power of 2. 


9.3.1 Famous Impossibilities 


We can now answer some questions about geometric constructions that were posed 
and not answered for many centuries. 


Theorem 9.7 (Doubling the Cube) Given a cube with side of one unit length, it 
is not possible to construct with straightedge and compass the side of a cube having 
volume twice that of the original one. 


Proof 9.12 Since the original cube has side of length 1, then the cube with double 
the volume would have a side of length \/2. Hence, in order to construct this larger 
cube it would be necessary that \/2 be a constructible number. But \/2 is a root of the 
irreducible polynomial x* — 2 € Q|z] and so by Corollary 9.4 cannot be constructible. 


Theorem 9.8 (Squaring the Circle) Given a circle with radius of unit length, it 
is not possible with straightedge and compass to construct a square having the same 
area of the circle. 
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cos @ 


Figure 9.6 Diagram used for show that cosine of a constructible angle is also con- 
structible. 


Proof 9.13 Since the circle has a radius of length 1, the square we wish to construct 
would have to have a side of length \/x and so it would be necessary that \/7 be 
constructible. Suppose ,/m were constructible. Then x would be constructible and thus, 
by Corollary 9.4, [(Q(./7) : Q| = 2". In particular, would be algebraic over Q 
contradicting that m is, in fact, transcendental over Q. 


Before we present the next proof we point out that some angles can be trisected, 
like for instance a 180° angle can be trisected, since (as we shall see below) 60° is a 
constructible angle. What we mean by an angle @ being constructible is that one 
can construct with straightedge and compass two lines which intersect so that at least 
one of the angles between the lines is 0. 


Theorem 9.9 (Trisecting the Angle) /t is not always possible to trisect an an- 
gle using straightedge and compass. 


Proof 9.14 We prove this result by a series of claims. 
Claim 9.2 If an angle @ is constructible, then the number cos @ is constructible. 


Construct two lines which intersect to make an angle 6 and mark off on one line 
from the point of intersection a line segment of length 1. From the other end of the 
line segment drop a perpendicular onto the other line to form a triangle (Figure 9.6). 
The resulting triangle has base of length cos 0. 


Claim 9.3 60° is a constructible angle. 

Construct two lines which intersect at right angles. Mark off a length 1 on one 
line and /3 (which is constructible) on the other line. Connect the two endpoints to 
form a triangle (Figure 9.7). 

Claim 9.4 The number cos 20° is a root of the polynomial 82° — 6x — 1. 

Notice that 


1 
5 = 60s 60° = cos (40° + 20°) = cos 40° cos 20° — sin 40° sin 20° 


= (cos* 20° — sin? 20°) cos 20° — (2 sin 20° cos 20°) sin 20° 
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Figure 9.7 Diagram used for show that cos 60° is a constructible angle. 


= cos? 20° — 3(sin? 20°) cos 20° = cos® 20° — 3(1 — cos? 20°) cos 20° 
= 4cos? 20° — 3cos 20°. 


Multiplying both sides by 2 and throwing everything to one side yields 8 cos? 20° — 
6 cos 20° — 1 = 0 which proves the claim. 


Claim 9.5 [Q(cos 20°) : Q| =3 


First note that 82° —6x—1 € Q{2] is irreducible in Q|z], for otherwise 8x3 —6x—1 
would have a linear factor in Q(x] and so would have a rational root. By the Rational 
Root ee the only candidates for rational roots for 873 — 6x — 1 are +1, , 
i, 5 But none of these candidates are roots of 8x? — 6x — 1 and so it must be 
iredicibie, Therefore, by Claim 9.4, [Q(cos 20°) : Q] = 3. 

We can now show that 60° cannot be trisected using straightedge and compass. 
Suppose, to the contrary, that 60° could be trisected. Then, by Claim 9.3, 20° would be 
constructible. Then, by Claim 9.2, cos 20° would be a constructible number. However, 


Claim 9.5 and Corollary 9.4 are in direct contradiction. 


EXERCISES 


1 Prove that if a,b are two positive constructible real numbers, then so is a/b 


(when b £ 0). 


2 Prove Corollary 9.1. 


9.4 ALGEBRAIC EXTENSION & CLOSURE 


We now introduce two additional and relevant concepts to the arena, namely algebraic 
extension and algebraic closure, which we will need in our future discussions. The 
similarity in the names of our terminology and in their definitions can easily lead to 
confusion and so the reader is advised to pay careful attention to the distinctions 
made between them. 


Definition 9.6 Let F C E fields. The field E is algebraic over F if for every 
a € E is algebraic over F. The field E is then called an algebraic extension of F’. 
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Example 9.10 Q(V2) is algebraic over Q. Take any a+by/2 € Q(V2). Then atby/2 
is a zero of x? — 2ax + (a? — 2b?) € Q[a}. 


Lemma 9.4 Let FC E be fields. If |E : F] < 00, then E is algebraic over F. 


Proof 9.15 Let |E : F] = n < o and take any a € E. Since the vector space 
dimension of E over F is n, the set of n+1 vectors 1,a,a7,...,a” must be linearly 
dependent over F'. Therefore, there are scalars co, C1, C2,---,Cn © F not all zero such 
that col + cya+---+c,a" =0. But this says that a is algebraic over F since a is a 
zero of p(x) =cotcat:::+cn2” € Fla]. Since a was arbitrary in E, we have that 
E is algebraic over F. 


Theorem 9.10 Let a1, a2,...,d, € ED F be fields with E = F(a, a2,...,an). The 
following statements are equivalents: 


1. E is algebraic over F. 
2. Each a; is algebraic over F. 


3. |e FB] <o0. 


Proof 9.16 The first statement implies the second is immediate from the definition 
of E being algebraic over F. To prove the second statement implies the third, since 
each a; ts algebraic over F from previous work we know that |F(a;) : F] = di < 
co where d; is the degree of the irreducible polynomial of a; over F. Let’s call that 
polynomial p;(x) € Fa]. We show the third statement is true by induction on n, 
the number of generators of E over F. For the n = 1 case, E = F(a) and so 
[Ee B) = [Plan FP) dy ee. Porn 1, seh ie = (as, Gos os tig 7 80 that 
FCKCE=K(a,). By the induction assumption, we know that [Kk : F] < co. We 
claim that |E : K] < co and so by Theorem 9.3, [E: F] =|E: K][K : F] < 00 and 
we are done. To see this, note that py(dn) = 0 with p, € Fla] C K[ax]. Hence, ap is 
algebraic over K and so [E: K| = |K(apn): K] < ow. 
The third statement implies the first follows immediately from Lemma 9.4. 


Corollary 9.5 [f F C E C K with K algebraic over E and E algebraic over F, 
then K is algebraic over F. 


Proof 9.17 Take ana € K. By assumption, a is algebraic over E. Let p(x) = 
Cot ee +++ + e,2" € Ela] be the minimal polynomial of a over E. Consider the 
following chain of fields: 


Be By Per Cina deg Ga Sle =P (epi ga, — a): 


By assumption, each c is algebraic over F and so by Theorem 9.10, |Ey : F] < ov. 
Furthermore, |L : E,] =n < oo. Therefore, [L : E,| < oo. But then by Theorem 9.10 
again, we have a is algebraic over F. 
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Theorem 9.11 Let E be algebraic over F. The following statements are equivalent: 
1. E is finitely generated over F. 


at a ale aeo oF 


Proof 9.18 The first statement implies second follows immediately from Theo- 
rem 9.10, since by assumption each of the generators is algebraic over F. To show 
the second statement implies the first, we employ induction on n = [E : F] < ow. 
For n = 1, we’ve seen that E = F and so E = F(1) is finitely generated by 1 over 
F. Forn > 1, take anya € E'\ F which we know by assumption is algebraic over 
F and so 1 < [F(a): F] < o. Since [E : F] < 00 then [E: F(a)] < [E: FI. 
Since E is algebraic over F,, it is certainly algebraic over F(a), thus we can invoke 
induction to get that EF = F(a)(a,,a2,...,a,) for some ay,a2,...,a, € E. But then 
E = F(a,a1,...,a,%) and we are done. 


Definition 9.7 Let F C E be fields. The algebraic closure of F in E, written 
acde(F) ={aeE : a is algebraic over F}. 


We say that F is algebraically closed in F if aclg(F) = F, i.e. Ifa € E is 
algebraic over F’, thena € F.. We say that F is algebraically closed if aclz(F’) = F 
for all fields E > F. We say that E is an algebraic closure of F if E is algebraically 
closed and E is algebraic over F. We will denote an algebraic closure of F by F. 


Remark 9.5 We make several remarks concerning aclp(F). 


1. It is immediate from the definition that if E is algebraic over F’, then aclp(F’) = 
E. 


2. For any field ED F, aclg(F) is a subfield of E. To see this, Take any a,b € 
algz(F’). By Theorem 9.10, since a and b are algebraic over F', we have that 
F (a,b) is algebraic over F, i.e. every element in F(a,b) is algebraic over F. 
Since a,b € F(a,b) a field, then so are a—b,ab~' € F(a,b) and hence they are 
algebraic over F, i.e. a—b,ab~! € aclp(F). 


3. It’s immediate from its definition that aclg(F) is algebraic over F’. 


4. For any field E D> F, we have F C aclp(F), since for a € F the linear polyno- 
mial p(z) =x —a€ FI] has a as a zero. 


5. It is always the case that aclg(F’) is algebraically closed in E, and so 
aclp(acle(F)) = aclp(F). One proves this by showing that every a € E 
algebraic over aclp(F) is, in fact, algebraic over F. To see this, first set 
K = acle(F) and take a € E algebraic over kK. This means there is an 
f(x) = dn2"+-+-+a1x+a9 € K[x] which has the element a as a zero. Since each 
a; € K, this means that each a; is algebraic over F. But then by Theorem 9.10, 


[F'(ao, @1,---+,@n) : F] < co. Now certainly a is algebraic over F'(ao, a1, .--, Gn) 


Field Theory @ 237 


via the polynomial f(x) and so |F(ao,a1,...,@n)(a) : Fao, @1,.--,@n)] < oo. 
Therefore, 
[F'(ao,@1,..-,@n,a@) : F] 


=| Fay tig st 15 Gn) CO) > Flag ayes o35y)| (Flag; O16 os p Onl > FF] ees 
Now, by Theorem 9.10, a is algebraic over F. 


6. Unlike Theorem 9.10, it is not true for an arbitrary field E algebraic over F 
that we have [E': F| < oo, i.e. the converse of Lemma 9.4 is false. For example, 
set F = Q and E = aclc(Q) a field, by the first remark, algebraic over Q (called 
the field of algebraic numbers). However, we show that [E : F] = oo. To see 
this, consider the polynomial p(x) = «”—2 € Q|a] for any odd number n. Using 
Eisenstein’s Criterion, p(x) is irreducible in Q|x}|. Since p(x) is a polynomial 
of odd degree, it must have a real zero, say a € R (and soa € E). Hence, since 
p(x) irreducible with a zero of p(x) being the element a, we have that p(x) is 
the irreducible polynomial of a over Q, and so [Q(a) : Q| =n. Certainly, since 
ED Qa), it’s true that [E : Q| > [Q(a) : Q| =n for any odd number n. But 
then one must conclude that [E : Q] = co. Note that by Theorem 9.11, that it 
must be the case that aclc(Q) is not finitely generated over Q. 


Example 9.11 Here, we give several examples to illustrate the myriad of definitions 
we just introduced. 


1. Q is not algebraically closed in Q(W2) for among other reasons V2 € Q(V2)\Q 
is algebraic over Q (via the polynomial x? — 2 € Q|z]). 


2. The field, aclp(Q) = {aE R : a algebraic over Q} although algebraically closed 
in R (see Remark 9.5.5) is not algebraically closed in C, since i is algebraic over 
acly(Q) via p(x) = 27 +1, yeti € acle(Q). Hence, we conclude that aclp(Q) is 
not algebraically closed. 


&. C is not algebraic over Q. The reason for this is because every algebraic ex- 
tension of Q must be countable. Indeed, if E is algebraic over Q then every 
a € E is algebraic over Q and so every a € E is a zero of some polynomial in 
Q|a]. This defines a map from E to Q|z] which is finite-to-one (i.e. for each 
f(x) € Q|z] there are a finite number of a € E which are zeros of f) and so 
the cardinality of E is the same as the cardinality of Q|x| which is countable. 


EXERCISES 


1 Consider the fields F C K C E. Prove that FE is algebraic over F iff E is 
algebraic over K and K is algebraic over F’. 


2 Let F C E be fields with E algebraically closed. Prove that aclg(F) is alge- 
braically closed as well. 
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9.5 EXISTENCE THEOREMS 


In Section 9.4, we defined some new structures one of which was the algebraic closure 
of a field. Now there is no reason a priori to assume such structures must exist. In 
this section, we give proofs of the existence of some of the concepts we defined in the 
previous sections. For instance, when we formed the finite field Z2(a@) we made one 
assumption which should really be justified, namely that the polynomial x? + x + 1 
does indeed have a zero (which we called a) in some larger field containing Z2. The 
theorem below puts this issue to rest. 


Theorem 9.12 Let F be a field. For any f(x) € F'\a] of degree at least 1, there exists 
a field ED F with ana€ E a zero of f(x). 


Proof 9.19 Since F|x] is a UFD, we can factor f(x) as a product of irreducibles. Let 
p(x) = cax4 +-+-+e,2+c9 be one of the irreducible factors of f(x). It suffices to show 
that there exists a field E D> F with ana € E a zero of p(x). Set E = F[zx]/(p(x)) 
which is a field, since p(x) is irreducible. Consider the canonical map ¢: F + E by 
o(c) =€=c+(p(x)). One can check that this homomorphism is one-to-one (use the 
fact that p(x) has degree at least 1). Therefore, E contains an isomorphic copy of F, 
namely F = ¢(F). Therefore, we will switch to this new base field F and it suffices 
to find a zero of p(x) = tax? +++» + G24 € Fa] in an extension field of F’. It just 
so happens that E is the field we seek. Indeed, consider T = x + (p(x)) € E. Notice 
that 


P(E) = E+ +--+ 5+ = p(x) =D, 


so that & is a zero of p(x) and the theorem is proved. 


Corollary 9.6 Let F be a field. For any fi(x),...,fn(w) € Fla] all of degree at 
least 1, there exists a field E D F with ay,...,Q, € E zeros of fi(x),...,fn(2), 
respectively. 


Proof 9.20 Simply apply Theorem 9.12 n times to form a chain of extensions the 
largest of which is the field E we seek. 


Corollary 9.7 The following statements are equivalent for a field F’. 
1. F is algebraically closed 
2. Every f © F[x] of degree at least one has a zero in F. 


3. Every f € F |x] of degree at least one factors completely as a product of linear 
factors in F(z]. 


4. Every irreducible polynomial in F\a] has degree exactly one. 


Proof 9.21 The first statement implies the second follows immediately from the pre- 
vious theorem and the fact that F is algebraically closed. The second statement implies 
the third follows by a simple induction on the degree of f. Indeed, if deg(f) = 1 the 
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result follows immediately without appealing to the second statement. If deg(f) > 1 
and c € F is the given root of f, then by Corollary 7.3, f(x) = (x—c)g(x). Since the 
degree of g is less than that of f, by induction, g factors completely into linear factors 
and hence so does f. The third statement implies the fourth is immediate. Finally, 
the fourth statement implies the first, for suppose F C E a field anda € E is alge- 
braic over F’. Set p(x) to be the irreducible polynomial of a over F. By assumption, 
p(x) = bx +c for some b,c € F. But then 0 = p(a) = ba+c and soa=—b'!ce F. 


We shall postpone the proof of the existence of an extension field of an arbitrary 
field which is algebraically closed until we discuss Zorn’s Lemma in a later section. 
However, under the assumption it is true, we can prove the existence of the algebraic 
closure of an arbitrary field. 


Theorem 9.13 For any field F there exists an algebraic closure of F. 


Proof 9.22 Let E be an algebraically closed field containing F and set K = aclg(F) 
a subfield of E containing F and algebraic over F and algebraically closed in E 
(see Remark 9.5 on aclp(F’)). We claim that K is the algebraic closure of F which 
we seek. It remains to prove that K is algebraically closed. To show this we prove 
every polynomial over K of degree at least one has a zero in K and then appeal to 
Corollary 9.7. Take f(x) € K[x] with deg(f) > 1. Since E is algebraically closed we 
know, by Corollary 9.12, there is ana € E a zero for f(x). Therefore, 


a € aclp(K) = aclg(acle(F)) = acdge(F) = K. 


Definition 9.8 Let F C E be fields and f(x) € F |x] of degree at least one. Then E 
is called the splitting field of f(x) over F if for some aj, a2,...,an € E we have 
E = F(a, a2,...,@n) and f(x) = c(x — a,) (az — ag)---(x—ay), for some ce F. We 
say that f(x) splits in E. 


Example 9.12 Let F = Q and f(r) = 2° —x27+2—-1€ Qa] which has zeros 1, +i. 
Then E = Q(1,i,—-i) is a splitting field of f(x) over Q. Note that E = Q(t) and 
[E: Q| =2. 


Theorem 9.14 For every f(x) € F |x] of degree at least one, there is a splitting field 
of f(x) over F. 


Proof 9.23 We prove this by induction on the degree of f(x). When deg(f) = 1, the 
result is immediate. when deg(f) > 1, by Theorem 9.12, there is an extension field 
E, > F and ana, € E, a zero of f(x). So write f(x) = (w— ax) fila), where fi(x) € 
FE, [x] and deg(fi) < deg(f). By induction, there exists a splitting field Ey of fi(x) 
over Fy, i.e. Ey = Fy (a2,..., dn) with fi(a) = cla —a2g)---(t@—cp), C2,---,Cn € Eo 
andcé€ F. Then E = F(aj,@,...,Gn) ts the desired splitting field. 


Remark 9.6 We make several remarks about splitting fields. 


1. One can show by induction that if E is a splitting field for f(x) over F and 
deg(f) =n, then [E: F] < nl. 
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2. If E is a splitting field of f(x) over F, then E is algebraic over F' since each 
a; is algebraic over F' each being a zero of f(x) (see Theorem 9.10). 


3. By Theorem 9.7, if E is an algebraically closed field containing F, then every 
polynomial in F of degree at least one splits in E. 


EXERCISES 


1 Prove that the canonical map ¢: F > E by ¢(c) =t=c+ (p(z)) in the proof 
of Theorem 9.12 is one-to-one. 


2 Prove by induction that if F is a splitting field for f(x) over F' and deg(f) = n, 
then [E': F] < nl. 


9.6 FINITE FIELDS 


In this section, we classify completely the fields of finite order. Namely, we show that 
every finite field is of prime power order and for every prime to a power there is exists 
a unique field of that order. We first show that every field is of prime power order 
and for every prime power there exists a field of that order. We prove uniqueness at 
the end of the section. 


Theorem 9.15 I/f E is a finite field, then |E| = p” where p is a prime and n is a 
positive integer. Furthermore, the characteristic of E is p and |E : F| = n where 
F = Z, is the prime subfield of E. 


Proof 9.24 First note that char(E) 40, for otherwise E would contain an isomor- 
phic copy of Q making it infinite. Therefore, char(E) = p for some prime p and 
contains an isomorphic copy of Zp, which we will designate by F. Since E is finite, 
then certainly so is the degree, |E : F]. Setn =[E: F| and let ay,a2,...,an be a 
basis for E over F. Thus, every element of E can be written uniquely as a linear com- 
bination of a1, @2,...,@, over F. This allows us to count the elements of EF, namely 
there are exactly 


FL -[F| e+ LE] = [FI =p". 
-e_-—-—-- 
n times 


We now know that any finite field is of prime power order, however we still need 
to show the converse that for every prime power there is a finite field of that given 
order. Before we do this we need to derive some results which are invoked in the 
existence proof. 


Lemma 9.5 For any field F and irreducible polynomial p(x) € F(z], if f(x) € Fla] 
has a zero in common with p(x) in some extension field of F', then p(x) divides f (x) 


Proof 9.25 Suppose under the assumptions given in this lemma that p(x) did not 
divide f(x). Since p(x) is irreducible, it must then be the case that p(x) and f(x) are 
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relatively prime. Hence, there are polynomials g(x), h(x) € F |x] such that g(x)p(x) + 
h(x)f(a) =1. Leta € ED F be the common zero of p(x) and f(x). Plugging this in 
we get 

1 = g(a)p(a) + h(a) f(a) = g(a) 0+ h(a) -0=0, 


a contradiction. 


Lemma 9.6 I[f f(x), g(x) € F[z] are relatively prime in F |x|, then they are also 
relatively prime in E|ax] for any extension field E D F. 


Proof 9.26 We prove the contrapositive statement. If f(x), g(x) € Fa] are not rela- 
tively prime in E|x], then there is an irreducible polynomial q(x) € E|x] which divides 
both f(x) and g(x). Let a be a zero of q(x) (which we know exists by Theorem 9.12 
of Section 9.5). Then a is also a common zero of f(x) and g(x). Let p(x) € F [a] be 
an irreducible factor of g(x) having a as a zero. Then, by Lemma 9.5, p(x) divides 
f(x) and so p(x) is a common divisor of f(x) and g(x), t.e. f(a) and g(x) are not 
relatively prime in F [a]. 


We will need the concept of the formal derivative of a polynomial. 


Definition 9.9 Let F be a field and f(x) = ag+a,x+--++anx" € F [a]. The formal 
derivative of f(x), written f'(x) = a, + 2aga +--+ +nan,z"!. 


Notice that this definition of derivative conforms to the usual definition of deriva- 
tive in Calculus and as such shares the same properties that the Calculus derivative 
enjoys, namely it’s a linear operator with the same product rule, quotient rule, etc. 
Of course all these statements require verification, however for the sake of brevity we 
omit them (the ambitious reader can easily verify these statements). 


Remark 9.7 
By Corollary 9.7 of Section 9.5 we know that we can express 


flo) = ela ~ ay) +(e — aq) =e] [le a), 


for some c€ F anda; € E the zeros of f(x). Then by the product rule, 


Hence, in the UFD E[z], (a — a;) divides f'(x) for some j iff (x — aj) divides 
[[%-1 (a — a;). And this is true iff a; = a; for some i # j, or equivalently f(x) does 
tA 
not have distinct zeros. 
Lemma 9.7 Let F C E be fields and E an algebraic closure of F. If f(a) € F|a] is 
a polynomial with deg(f) =n > 1, then f has n distinct zeros in E iff f'(x) #0 
and gcd( f(x), f’(x)) =1 in F[a]. 
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Proof 9.27 Assume first that f'(x) 4 0 and gcd(f(x), f'(x)) = 1 in Fla]. By 
Lemma 9.6, gcd(f(x), f'(x)) = 1 in Elz] as well. So in particular, for all 7, we 
have that (a —a;) does not divide f'(x). By Remark 9.7, f(x) has n distinct zeros. 
To show the reverse direction we prove the contrapositive statement. Suppose that 
either f'(x) =0 or gcd( f(x), f’(x)) 41 in Fla}. If f(x) = 0, then certainly, for all 
j, we have (x —a,;) divides f'(x) which, by Remark 9.7, implies that f(x) does not 
have n distinct zeros. If gcd( f(x), f’(x)) #1 in F[x], then for some j we have that 
(x —a,) divides f'(x) which again, by Remark 9.7, implies that f(x) does not have n 
distinct zeros. 


Example 9.13 We apply Lemma 9.7 to several examples. 


1. Consider the polynomial f(x) = x? +1 € Q{z] which has three distinct zeros 
in C (which later we show is an algebraic closure of Q), since f'(x) = 3x7 4 0 


and gcd( f(x), f'(x)) =1. Indeed, the three distinct zeros are —1 and } + v3i., 


2. Consider the polynomial f(x) = x2 + 2x +1 € Q{z] which on the other hand 
does not have two distinct zeros, since f'(x) = 2x + 2 and gcd(f (x), f’(x)) = 
x+11. Indeed, —1 is a double root of f(x). 


3. Consider the polynomial f(x) = 73 +1 € Zs{x]. One might argue that as in 
the first example f(x) has three distinct zeros, however C is not an extension 
field of Z3. Indeed any extension field of Z3 would have to have characteristic 3 
in order to have Z3 as a subfield. So we have to look at this polynomial afresh. 
Notice that f'(x) = 0 in Zs|a], so in fact, by the lemma, f(x) does not have 
distinct zeros in an algebraic closure. Moreover we need not look any further 
than Z3 to find all of its zeros. Notice that in Z3[r], x? +1=(x+1). Thus, 
the element —1 or 2 € Z3 is a triple zero of x? +1. 


Having proved Lemma 9.7, we are now in a position to give the existence proof 
of a finite field for any prime power order. 


Theorem 9.16 Given a prime p and a positive integer n, there is a field of order 
nm 


p”. 
Proof 9.28 Let E be an algebraic closure of Z, and consider the polynomial f(x) = 
x?” —x € Z,|2]. Since f'(xz) = -1=p—140 and gcd( f(z), f’(x)) =1 in Z,[a], by 
Lemma 9.7, we know that f(x) has p” distinct zeros in E. Set Fyn to be the collection 
of p” zeros of f(x) in E. Note that a € Fyn is a zero of f(x) iff a?” =a. To complete 
the proof we need to show that Fyn is a subfield of E. We will rely on a straight- 
forward induction result (which we leave to the reader) which says that in a field E of 
characteristic p, for alla,b € E we have that (a+b)?” = a?" +b?". Indeed, E is a field 
of characteristic p since it contains Zp as a subfield. So take any a,b € Fy» and notice 
that (a +b)?” = a?” +0" =atb, so thata+b € Fyn and (ab)? = a?" = ab, 
so that ab € Fin. Finally, for a € Fyn and p odd, (—a)?" = —a?" = —a, so that 
—a € Fyn and for p = 2, (—a)?" =a?" =a=—a, so that again —a € Fyn. 
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To prove the uniqueness of a finite field for a given prime power order, we will 
first need to prove several lemmas. The first lemma is a fact about fields that perhaps 
you would not guess. The second lemma is reassuring in the sense that it says we can 
always construct finite fields by adjoining a zero of an irreducible polynomial to the 
base field. For instance, in Section 9.4, we constructed a field of order 4 by adjoining 
a zero of 7 + 2+ 1 to the base field Zo. 


Lemma 9.8 Any finite subgroup of the multiplicative group of a field is cyclic. 


Proof 9.29 This proof relies on the Classification of Finite Abelian Groups and the 
Fundamental Theorem of Algebra. Let G < F* where F is a field and G is finite. 
Since G is a finite abelian group, G is isomorphic to a direct sum of non-trivial cyclic 
subgroups, G = H, ® Hp ®@--- ® Hn, where |H;| divides |H;_1| for 1 = 2,3,...,n. 
Set k; = |H;| fori = 1,2,...,n. Since the k; successively divide each other this 
implies lem(k1, k2,..., kn) = kn. Therefore, for all g € G we have g*" = 1. In other 
words, every element of G is a root of the polynomial f(x) = x*» —1 € Fla]. By the 
Fundamental Theorem of Algebra, f(x) has at most ky, distinct roots and so |G| < ky. 
But ky =|Hn| < |G| and thus G = H,, and is cyclic. 


Definition 9.10 The generator of the multiplicative group of a finite field is called 
a primitive element. 


Example 9.14 Consider the finite field with four elements computed in Exam- 
ple 9.9.8, 
Zo(a) = {0,1,a,1+ a}. 


It’s easy to check that a and 1+ a are primitive elements. 


Lemma 9.9 Jf F C E are finite fields, then E = F(a) for some a € E algebraic 
over F’. 


Proof 9.30 We have just seen that the multiplicative group of a finite field is cyclic 
so set E* =< a> for some a € E. Then E = F(a), since E is the smallest field 
containing F anda. To see this, suppose K is a field containing F anda. If be E*, 
then for some positive integer k, we have b= a* € K, by closure. Since E and F are 
finite, then so is |E : F|] < co which, by Lemma 9.1 of Section 9.2, implies that a is 
algebraic over F. 


Example 9.15 Let’s form a finite field with nine elements and find generators of the 
multiplicative group associated with the field. We will start with Z3 and the polynomial 
p(x) = x2 +1 which is irreducible over Z3 (simply verify that no element of Z3 is a 
zero of p(x)). Let a be a zero of p(x) in some extension field (which we know exists 
by Theorem 9.12 of Section 9.5). Then Z3(a) is the finite field we seek. Indeed, 


Z(a) = Z[a] = {0, 1, 2, a, l+a, 2+a, 2a, 1+ 2a, 2+ 2a}. 
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Since the order of Z(a)* is eight, we know that there are $(8) = 4 generators of 
Z(a)* (i.e. primitive elements). Let’s find them. Notice that 


(1) = {1}, (2) = {1, 2}, (a) ={1, 2, a, 2a} = (2a). 


Therefore, the remaining four elements 1+a, 2+a, 1+2a, 2+ 2a must each be 
a generator of Z(a)*. 


In the proof of Theorem 9.6, we proved the existence of a finite field of order p” 
in a constructive way as the zeros of the polynomial x?” — x. The next Lemma shows 
that this is true for any finite field of order p” and so we will be closer to our desired 
uniqueness result. 


Lemma 9.10 /f E is a finite field of order p” for p prime and n a positive integer, 
then the elements of E are exactly the zeros of the polynomial x?" — x and so x?" —x 
splits in E. 


Proof 9.31 Since E* is a group under multiplication, every element raised to the 
order of that group equals the identity, i.e. for alla € E* we have a?"~! = 1 or 
equivalently for alla € E* we have a?" = a. Therefore, for alla € E we have a?" —a = 
0 which shows that every element of E is indeed a zero of x” —x. Furthermore, it must 
be the case that x?" — x = (x — a1)(x — ag) +++ (@ — dyn) where E = {a1, G2,..., Apr}. 


Theorem 9.17 Two finite fields of the same order are isomorphic. 


Proof 9.32 Let E,, Ey be two finite fields of the same order. By Theorem 9.15, 
|E,| = |E2| = p” for some prime p and positive element n and E, and E each 
contain an isomorphic copy of Zp, call it F;, with [E; : Fi] =n fori = 1,2. Without 
loss of generality, we can assume that E, and Ey are, in fact, extension fields of 
Z,. By Lemma 9.9, E; = Zp(a;) with a; algebraic over Z, for some a; € E; for 
i = 1,2. Let p(x) € Z,|x] be the irreducible polynomial for a, over Zp. As usual, 
E, = Z,(a1) & Z,[2|/(p(x)). By Lemma 9.10 and Lemma 9.5, p(x) divides x?" — x. 
By Lemma 9.10, the elements of Ey are exactly the zeros of the polynomial x?” — x. 
Therefore, some b; € Eg is a zero of p(x). This makes p(x) the irreducible polynomial 
of b; € Ey over Zp so that Z,(b;) = Z,[x]/(p(x)) = Ey. Since |E,| = |Eg| and 
Zp(b;) C Ey we must have Ey = Z,(b;) = Ey. 


For a given prime p and a positive integer n, the notation Fyn shall denote the 
unique (up to isomorphism) finite field of order p” called the Galois field of order 


a) 


We will finish this section with further results regarding finite fields. 
Lemma 9.11 /f F C E finite fields, then |E| =|F|" for some positive integer n. 


Proof 9.33 We've seen that E = F(a) for some a € E algebraic over F and |E : 
Fl =n < ov. Let a1, a2,...,dn be a basis for E over F. Since every element of E can 
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be written uniquely as linear combinations of a1, a2,...,d, over F, we can therefore 
count the elements of E, namely 


|E| = |F|-|F]----- | Fl = |F|”. 
— 
n times 


Theorem 9.18 For p prime and k,m positive integers, Fx C Fym iff k\m. 


Proof 9.34 If Fx C Fym, then by Lemma 9.11, p™ = |Fym| = |Fye|” = pe”, for 
some positive integer n, which implies that m = kn or k|m. Now suppose that k|m so 
that m = kn for some positive integer n. By Lemma 9.10, every element of a € Frym 
satisfies a?” =a and every element of b € Fx satisfies bP" = b. Notice that 


n times 
——< 


je — pp p®---p* = CaseCeee ie eka? =b, 


so that b € Fym as well. Hence, Fir © Fom. 


EXERCISES 


1 Check that a and 1+ a are primitive elements of Z2(a) in Example 9.9.3. 


2 Verify for polynomials that the formal derivative satisfies the sum, product and 
quotient rule. 


3 Let E bea field of characteristic p. 


a. Prove that p divides ( : for? =1,2,...p—1. 


b. Prove that (a+ 6)? = a? + bP using the binomial theorem. 


c. Prove by induction on n that for all a,b € E we have that (a+b)?" = a?" +b?". 


CHAPTER 10 


Galois Theory 


N THIS CHAPTER, we make a big connection between field theory and the theory 
| of groups called Galois theory. Such connections are ways in which powerful tools 
can be created for proving difficult mathematical results. The result which motivated 
Evariste Galois to develop this theory and its connections is the investigation of 
solvability by radicals. For the reader who is interested in the history of mathematics, 
Galois’ biography is quite interesting and dramatic. 

In Section 10.1, we relate fields and groups via field homomorphisms of an exten- 
sion field which fix the base field, thus arriving at Galois groups. We need Section 10.2 
so that we can compute these Galois groups in a practical way. Section 10.3 is a bit 
off the beaten path, but at the very least the reader should be aware of the important 
results arrived at in this section. In Section 10.4, we introduce the notion of a splitting 
field and prove two important results linking the Galois group and the corresponding 
field extension. In Section 10.5, we introduce separable degree and link this concept 
to the size of the corresponding Galois group. In Section 10.6, introduce the notion 
of a Galois extension as we compare the lattice of subgroups of the Galois group and 
the corresponding lattice of intermediate fields in the corresponding field extension. 
In Section 10.7, we give the promised proof of Theorem 10.7 presented in Section 10.6 
as well as prove a foundational result of Artin. In Section 10.8, we summarize our 
investigation in this chapter and prove the Fundamental Theorem of Galois Theory. 
In Section 10.9, we review Chapter 6 emphasizing the important concepts and results 
given therein regarding solvable groups. Finally, in Section 10.10, we investigate the 
notion of solvability by radicals. 


10.1. FIELD HOMOMORPHISMS 


In this section, we can begin to make an important connection between fields and 
groups. Namely, that every pair F’ C F of fields can be related to a certain collection 
of field homomorphisms which form a group under composition. Recall that a non- 
trivial field homomorphism is necessarily one-to-one and therefore any non-trivial 
finite field endomorphism is an automorphism, i.e. bijective homomorphism from a 
field to itself. We will be looking at a certain subset of field homomorphisms. 
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Definition 10.1 Let E, and E, be two fields containing the same subfield F. A field 
homomorphism @: Ey, + E, is an F-homomorphism if it is the identity map on 
F, i.e. for allc € F we have ¢(c) =c. 


Remark 10.1 Here, we make some quick observations about F-homomorphisms. 


1. We point out that any F-homomorphism between fields is also a F'-vector space 
homomorphism (i.e. linear transformation) between the two fields. We have 
actually employed this fact several times already in earlier lessons. We now give 
a formal argument that this is so. Suppose @: Ey + E2 is an F-homomorphism. 
Then for vectors a,b € E, we have ¢(a+b) = ¢(a)+¢(b), since d preserves field 
addition. And for a scalar c € F and vectora € Ey, $(ca) = ¢(c)¢(a) = ce(a), 
since @ preserves field multiplication and fixes F’. 


2. The reader should verify that the collection of F-homomorphisms from Ey to 
Ey with composition forms a group. 


Example 10.1 An important example of an Zp-automorphism of Fyn is the Frobe- 
nius automorphism defined by op(a) = a?. The collection of maps oy% : Fyn — Fyn 
by op«(a) = a? ( fork =1,2,...) are all Z,-automorphisms of Fyn. 


Remark 10.2 We make several remarks regarding the Frobenius automorphism. 


1. For any positive integer m we have that (px) = Opme, since for any a € Fy», 


m times 
kok k 
(pp)™(a) = ophoye == ope (a) = (= ((aP* PP = PPP = aP™ 
ex~__—_— 
m times 
= Opmk (a). 


2. The homomorphism Opn : Fyn —> Fon is the identity, since each element of Fyn 
is a zero of xP” — x. 


3. The inverse of Op 18 Op-x, since for any a € Fyn, 


Ops (Op-«(a)) = opx(a?) = (a? )" =a. 


4. The order of Op : Fyn — Fyn is n, since (op)" = Opn which we know to be the 
identity and if it were the case that (op)* were the identity forO<k <n, then 
that would mean every element of Fyn is a zero of x? —x and so by Lemma 9.10, 
Fyn = Fox and k =n. 


5. For k > m, the homomorphisms op%,0pm + Fyn — Fyn are equal iff k = 
m(mod n), since o, = opm iff Op(Opm)~* is the identity iff (o»)*~™ is the 


identity iff n\(k—m) iff k=m(mod n). 
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Definition 10.2 Let F C E be fields. The set of all F-automorphisms of E with 
composition, denoted by Gal(E/F’), is called the Galois group of E over F. 


Example 10.2 We can already determine some instances of elements in Galois 
groups. 


1. From our earlier remarks we see that op, 0p2,-..,0pr € Gal(Fyn/Fp). with 
(Op) — {pp O piety Opry: 


2. More generally, a, is an element of Gal(Fyn/F,«)of order n/k. Indeed, 
(o¢)°/*) = opn which is the identity in Gal(Fy»/F,«) and (o,«)™ ts the iden- 
tity in Gal(Fyn/Fyx) iff nlmk iff (n/k)|m. Hence, Gal( Fyn/F,«) contains 
the cyclic subgroup (opr) = {Opk, Tp2e,---,Tpr} with |(opx)| = n/k. 

What we especially want to investigate about these F-homomorphisms is what 

they do to the zeros of a polynomial. 


Definition 10.3 Leta, € E, and ag © Ep» be two fields containing the same subfield 
F with a, and az both algebraic over F. We say that a, and ag are conjugate over 
F if they have the same irreducible polynomial over F’. 


Example 10.3 We illustrate conjugacy with several examples. This type of conjugacy 
is not equivalent to conjugacy of complex numbers. 


1. Set Fy = Ey =C and F=Q. The elements 1+% and 1—1% are conjugate over 
Q since they are both zeros of the same irreducible polynomial x? — 2x +2 over 


Q. 


2. Set E, = Ex = C and consider the elements \/2 and iW2 which are conjugate 
over Q since the irreducible polynomial of W/2 and iv/2 over Q is x* — 2, yet 
not Consors over Q(V/2) since the irreducible polynomial of W2 over Q(V2) 
is x2 — /2 while the irreducible ine of ix/2 over Q(V2) is x2 + 4/2. In 
fact, all four elements +W2, V2 are all conjugate over Q. 


Theorem 10.1 Let a, € Ey and ag € E> be two fields containing the same subfield 
F with a, and ag both algebraic over F. The following statements are equivalent: 


1. a, and ag are conjugate over F. 
2. There exists an F-homomorphism @: F(a,) > Faz) with @(a,) = ag. 
3. There exists an F-isomorphism $: F(a,) + F(a2g) with (a1) = aa. 


Proof 10.1 To show the first statement implies the third, let p(x) be the irreducible 
polynomial over F for which both a, and az are zeros. Consider the evaluation epi- 
morphisms Va, : F|z] > Flai] = F(a) by Va,(f(x)) = f(a) fori = 1,2. Both maps 
have kernel (p(x)). Let ¢; be the map which makes F|x]/(p(x)) and F(a;) isomor- 
phic defined as ;(f(x)) = f(a;) for i = 1,2. Then the composition 2 0 ,' is an 
F-isomorphism from F(a,) to F(a) with ¢2 0 ¢;'(a1) = ¢2(%) = ap. 
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The third statement implies the second is immediate. 

To show the second statement implies the first, let 6 : F(a,) + F (ag) be an 
F-homomorphism with o(a1) = ag. Since a, is algebraic over F there exists an irre- 
ducible polynomial p(x) € Fx] for which a, is a zero. Write p(x) = coteyat+:--+cegr". 
Notice that 

0 = $(0) = $(p(a1)) = O(co + cra + ++ + eget) = 


b(co) + b(c1)b(a1) +--+ + (eg) (a1) = co + Crag +--+ + cgad = play). 


Hence, az is the zero of the same polynomial irreducible polynomial over F as ay. 


EXERCISES 


1 Given the fields FE, and Ey below each containing the base field F’ decide if a 
and 6 are conjugate over F’. 


a fy =C= ki, F=Q, a=iand b= -1. 
b E=R=H, F=Q,a=V2and b=-v?2. 
c. Ey =R=E, F = Q(v2), a= V2 and b = — v2. 


2 Verify that the collection of F-homomorphisms from FE, to Ey with composition 
forms a group. 


3 Verify that the collection of maps 0,4 : Fyn —> Fyn by ope (a) = a? (k =1,2,...) 
are all Z,-automorphisms of Fyn. 


10.2 COMPUTING GALOIS GROUPS 


In order to compute concrete Galois groups we first need some results which will help 
us characterize the elements of a Galois group. Two theorems are presented followed 
by some examples which apply these results. 


Theorem 10.2 Consider the fields F C E and let ay,d2,...,d, € E be algebraic 
over F and set K = F(aj,a2,...,an). Then each element ¢ € Gal(K/F) is com- 
pletely determined by where it sends the generators of K over F, i.e. it is sufficient 
to know the values of $(a1),$(a2),...,0(Gn) in order to compute o(a) for anyae K 
and should there be another ~ € Gal(K/F) with v(ai) = ¢(ai), 1 <i <n, then 
wv(a) = d(a) forallae K. 


Proof 10.2 The proof is by induction on the number of generators. For the base 
case we have K = F(a,) = Flay]. Take any f(ai) € F(a1) and write f(a) = 
cot cia, + +++ + cpak with each  € F. Then for ¢ € Gal(K/F) we have that 
b(f(a1)) = co + e16(a1) + +++ + c46(a1)* and thus the value of 6(f(a1)) is completely 
determined by the value of ¢(a,). For n > 1, set Ky = F(ay,a9,...,@n—1) so that 
K = Ky (an) = Ky [an] (since ay is algebraic over F it is certainly algebraic over Ky D 
F as well). Take any f(an) € Ky(an) and write f (an) = bo + ban +--+ + beak with 
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each b; € Ky. Then for ¢ € Gal(K/F) we have that $(f(an)) = o(bo) + (61) 6(an) + 
--» + O(c) O(an)*. By induction, each $(b;) is completely determined by the values of 
(a1), O(a2),...,0(An—-1) so that @(f(an)) is completely determined by the values of 
o(ar), o(a2), He O(a). 

Theorem 10.3 Let F C E be fields with f(x) € Fx] and suppose that ay, d2,...,an 


are the zeros of f(x) contained in E. Then each @ € Gal(E/F) defines a unique 
permutation of a1, d2,...,@n- 


Proof 10.3 Take ¢ € Gal(E/F). For each a; notice that ¢ maps F'(a;) into F(¢(a;)), 


since @(co + cia; +--+ + Cpa?) = co +. c1d(ai) +--+ Cnd(ai)”. Hence, we have the 
restriction @: F(a;) > F(o(a;)). By Theorem 10.1, a; and $(a;) are conjugate and 


this means that @(a;) = a; for some j € {1,2,...,n}. Now since @ is one-to-one and 
the set {a1,Q2,...,Qn} is finite, we see that the restriction @ : F(a1,a2,...,@n) > 
F(a, @2,...,@n) defines a permutation of a1, 42,...,@n (one should check that @ maps 


into F(a1,a2,...,dn)). By Theorem 10.2, each element of Gal(E/F) must define a 
distinct permutation of a,,d2,..-.,Qn.- 


Example 10.4 Using the two theorem just proved we now compute some Galois 
groups. 


1. Gal(Q(W2)/Q) is the trivial group. Indeed, W/2 is a zero of f(x) = 2? —2 € Q 
and so for any ¢ € Gal(Q(W/2)/Q), by Theorem 10.8, ¢(x/2) must be a zero of 
f(x) contained in Q(W/2). However, W/2 is the only zero of f(x) contained in 
Q(W/2) (exercise). Therefore, it must be the case that ¢(./2) = W2. Since the 
identity map also has this property, by Theorem 10.2, 6 must be the identity 
automorphism. 


2. Gal(Q(i)/Q) = Ze. Indeed, i is a zero of f(x) = x7 +1 € Qa] and the other 
zero of f(x) is —i also contained in Q(t) so that there are potentially two Q- 
automorphisms of Q(i): one sending i to i and —i to —i and the other sending 
i to —i and —i toi. We can, in fact, describe two such Q-automorphisms. Of 
course, the first would be the identity automorphism. For the second, first note 
that since i is algebraic over Q and [Q(t) : Q) = 2 that Qi) = {a+ bi 
a,b € Q} (we know that 1,i forms a basis for Q(i) over Q). Define ¢: Q(t) > 
Q(z) by d(a + bt) = a— bi. The reader should check that this indeed defines 
a automorphism of Q(i). Therefore, Gal(Q(t)/Q) is a group with exactly two 
elements and so is isomorphic to Zo. 


3. Gal(Q(V2, V3)/Q) is the Klein 4-group. Indeed, /2 is a zero of f(x) = 12-2 € 
Q[x] and the other zero of f(a) is —V/2 also contained in Q(vV2, V3) and V3 is 
a zero of g(x) = 2?—3 € Q|z] and the other zero of g(x) is —\/3 also contained 
in Q(V2, V3). By Theorem 10.8, each element of Gal(Q(V2, V3)/Q) defines 
a permutation of the zeros of f(x) and a permutation of the zeros of g(x). 
Hence, there are potentially as many as four elements in Gal(Q(V2, V3) /Q) 
corresponding to the permutations below: 


P29 Af 4/3. r/3 J/2 -V2 V3 -Vv3 
i eer ae en ae ee ey er ae 
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V2 -J/2 V3 -v3 V2 -V¥2 V3 -Vv3 
V2 -J/2 -V3 v3 }’ -J2 V2 -vV3 V3} 
We can, in fact, describe four such Q-automorphisms of Q(V/2, V3). The first 
automorphism is, of course, the identity automorphism. To describe the remain- 
ing three let’s first get a better description of Q(/2, V3). Consider the chain of 
fields 
QC Q(v2) € Av2, v3). 

Since /2 is a zero of the irreducible polynomial f(x) = 2? — 2 over Q (Eisen- 
stein), [Q(V2) : Q| = 2 and 1, V2 forms a basis for Q(V/2) over Q. Since V3 is 
a zero of the irreducible polynomial g(x) = x? —3 over Q(2) (it must be irre- 
ducible, for otherwise [Q(V2, V3) : Q(v2)] = 1 which puts V3 in Q(V2), a con- 
tradiction), [Q(V2, V3) : Q(V2)] = 2 and 1, V3 forms a basis for Q(V2, V3) 
over Q(/2). Hence, by Theorem 9.3, 


[Q(v2, V3) : Q] = [Q(v2, V3) : Q(v2)][Q(v2) : Q] = 2-2 =4. 


Furthermore, in the proof of Theorem 9.3 we see that the collection of products 
of the bases 1, /2 and 1, V3 forms a basis for Q(V2, V3) over Q. Namely, the 
basis is 1,\/2, /3, 6. Therefore, 


Q(v2, V3) = {a+ bV2+ceV3+dV6 : a,b,c,d€ QI. 
Consider the following maps from Q(V2, V3) to itself defined by 
a+bV2+cV3+dvV6 4 at+bV24+cvV3+dv6, 
a+bV2+cV34+dV6 4 a—bV24+cvV3—- dv6, 
at+bV2+ceV3+dV6 4 atbV2—-cv3—dv6, 
at+bV2+ceV3+dvV6 4 a—bV2—cV34dv6. 


The reader should check that these maps are indeed automorphisms which 


preserve addition and multiplication. Notice that these maps are permut- 
ing the zeros of f(x) and g(x) as required. Therefore, we know that 
|Gal(Q(V2, V3)/Q)| = 4. It remains to determine which of the order four 
groups this Galois group is, but this is simple. Observe that if you square each 
of the automorphisms (under composition) you get the identity automorphism 
and this property characterizes the order four group to be the Klein 4-group. 


Our next objective in the coming sections is to put a bound on the number of 
possible elements in a given Galois group. Perhaps the reader has already noticed 
from the examples that there is a relationship between the size of Gal(E/F’) and the 
value of |F : F']. We will show that, in general, |Gal(E/F)| < [E : F]. We will also 
discover conditions for when |Gal(E/F)| =|E: F). 


Example 10.5 Take E = Q(W/2) and F = Q. In Example 10.4.1, we found that 
|Gal(E/F)| = 1 while in Example 9.8.2 we found that |E : F] = 3. Hence, in this 
example |Gal(E/F)| < [E: F). 
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EXERCISES 


1 In each of the problems below, compute Gal(E/F’) as we did in Example 10.4. 
a. E=Q(vV2) and F=Q. 
b. E=Q(W2) and F=Q. 
¢. E=Q(/3) and F=Q(v2). 

2 In the proof of Theorem 10.3, verify that @ maps into F'(a1,a@2,...,@n). 

3 Verify that ¥/2 is the only zero of f(x) = x — 2 contained in Q(V2). 


4 In Example 10.4.2, verify that ¢: Q(i) — Q(t) by ¢(a + bi) = a — bi defines a 
automorphism of Q(2). 


5 In Example 10.4.3, verify that the four defined maps are indeed automorphisms 
which preserve addition and multiplication. 


10.3 APPLICATIONS OF ZORN’S LEMMA 


The main result of the next two sections, namely that the size of the Galois group 
Gal(E/F’) is bounded by the index |[E': F] requires the use of Zorn’s Lemma. We will 
assume the reader has no prior knowledge of Zorn’s Lemma and start from there. Since 
this is not a course on Set Theory, we will be brief. One of the foundational axioms 
which mathematicians assume about sets is the Axiom of Choice which basically 
states that given any infinite collection of sets one can define a rule which selects an 
element from each of those sets. It turns out that the Axiom of Choice is equivalent 
to Zorn’s Lemma, but before we can state it we need to define some terminology 
some of which you have already seen in this text. 


Definition 10.4 A relation < on a set X is a partial ordering if it satisfies the 
reflexive, anti-symmetric and transitivity properties, i.e. 


1. For alla € X, we have x <a. 
2. For alla,ye X ifx<yandy<x2, thenx=y. 
8. For alla,y,z€ X ifa<y andy < z, thena < z. 


The pair (X,<) is then called a poset (i.e. a partially ordered set). The set is called 
linearly ordered if in addition we have the following property: For all x,y © X 
eitherx <y ory <a. 


Example 10.6 Some examples of posets are the integers with < (hence, the nota- 
tion) or the collection of subsets of a set together with C. Note that the first example 
is, in fact, a linear ordering while the second, in general, is not. 


Definition 10.5 Let (X,<) be a poset. 
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1. An element x © X is a maximal element of X if for all y © X, whenever we 
have x < y it must be the case that y = x. 


2. An element x € X is an upper bound of a subset Y C X if for ally © Y, we 
have y < &. 


8. A chain in X is a collection of subsets of X which is linearly ordered by C. 


4. X is inductively ordered if every chain in X has an upperbound in X. 


We can now state Zorn’s Lemma which as we have stated is a consequence of the 
Axiom of Choice and the proof shall be omitted. 


Lemma 10.1 (Zorn) A non-empty inductively ordered poset has a maximal ele- 
ment. 


Example 10.7 We illustrate in several examples the use of Zorn’s lemma. 


1. We will use Zorn’s Lemma to show that every ring with 0 £4 1 has a maximal 
ideal. Let X be the collection of all proper ideals of R and partially order them 
by inclusion. Note that X is non-empty, since it contains the trivial ideal (here 
we need 0 # 1). If we can show X has a maximal element, then we will be 
done. By Zorn’s Lemma, it’s enough that we show every chain in X has an 
upperbound. Let C be a chain of elements in X. We show that the union of all 
the elements in the chain, let’s call it J, is the upperbound we seek. First note 
that J is indeed a proper ideal of R (and thus in X ). To see this take a,b € J. It 
must be that a € I, and b € Ig for some I,, Ig € C. Since C is linearly ordered, 
without loss of generality, let’s suppose that I, C Ip. Therefore a,b € In and so 
a—bée€I,C J. Now takeae J andr € R. Againa € I for some I €C and 
so ra,ar € I C J. What makes J proper is the fact that 1 is not contained in 
any of the elements of C (exercise) and thus is not in J as well. Furthermore, 
ICJ for all I €C and so J is the upperbound we seek. 


2. We will use Zorn’s Lemma to show that every non-trivial vector space over a 
field has a basis. Let X be the collection of all linearly independent subsets of 
a non-trivial vector space V. Note that X is non-empty, since X contains the 
singletons sets consisting of the non-zero elements of V. If we show X has a 
maximal element, then this element is the basis we seek (in linear algebra one 
can prove that a maximal linearly independent set of vectors form a basis for 
the vector space). By Zorn’s Lemma, it’s enough that we show every chain in 
X has an upperbound. Let C be a chain of elements in X. We show that the 
union of all the elements in the chain, let’s call it Y, is the upperbound we seek. 
We need to show then that Y is a linearly independent subset of V. Suppose 
there are scalars a,,d2,...,Qn, in the field and vectors v1, v2,...,Un € V such 
that ayvy + agvg + +++ +anUn = 0. Now each v; is in some linearly independent 
subset in C and since the elements of C are linearly ordered there must be some 
element of C, call it S, where all the vectors v1, v2,...,Un reside. Since S is a 
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linearly independent subset of V, it follows that a, = ag =--- = a, = 0, and 
s0 Y is a linearly independent subset of V and hence is an element of X. It is 
certainly an upperbound of C being the union of all the elements of C. 


3. We can now prove using Zorn’s Lemma that every field F has an extension 
which is algebraically closed. First, we show that F has a field extension E 
such that every polynomial f(x) € F[x] of degree at least one has a root in E. 
Let’s index the polynomials in F|a] of degree at least one by an indexing set 
1 dn-other words, P =A jfi(a) >. te Th, Let X = {a, 2-4 € I) be a set of 
indeterminates indexed by the same set I. Define R = F|X]| be the collection 
of polynomial in a finite number of indeterminates coming from X. We leave it 
as an exercise for the reader to verify that R is an integral domain. Set J equal 
to the ideal in R generated by P. 


Claim 10.1 J is a proper ideal in R. 


Suppose to the contrary that J = R. Then, in particular, 1 € J and so 
n 
LS SS Gees -++;Lijm,)filti), where each g; € R. 
i=1 


By Corollary 9.6, there is a field K D F with ay,...,d, € K zeros of 
fi(vi),.--,fn(a1), respectively. In the expression representing 1 above evalu- 
ate each x; at a; and any other variables evaluate at 0 to get 


n 


l= S- c fi(aj) = S- c,-0=0, a contradiction. 


i=1 i=1 


Thus, having proved the Claim, using Zorn’s Lemma in a similar manner to the 
first ecample there exists a maximal ideal M of R containing J. Now E = R/M 
is a field and the map 6: F > R/M by o(c) =t¢=c4+M is a field monomor- 
phism (recall all non-trivial field homomorphisms are monomorphisms). There- 
fore, E contains an isomorphic copy of F, namely F = ¢(F). Therefore, we will 
switch to this new base field F and look for zeros as we did in Theorem 9.12. 
Note that since each f;(x;) € M this implies that % is a root of fi(x;), where 
fils) = Card +--+ Ga; + % when fila) = cat? +--+ +12; + co. Hence, E 
is the field we seek. 

To show the existence of an algebraically closed extension of F’, by Corollary 9.7, 
it’s enough to construct a field extension E of F such that every polynomial 
f(x) € E[x] of degree at least one has a root in E. To do the, define a chain 
of fields F = Fo C Fi, C Fy C --- such that for each n € N and every 
fa(z) € Fy, [a] there is a root of fr(x) in Fr4i (which we know we can define 
by the work just completed). Now set E = Uncen Fn and E is the field we seek. 
Indeed, take any f(x) € Ela]. Then the finite number of coefficients in f(x) 
must lie in some F,, and so f(x) € F,,|x]. By our construction, f(x) has a root 
in Frat C E. 
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In preparation for our next result, we will require Zorn’s Lemma to prove a 
preliminary result on the way toward this result. But before we do this preliminary 
result, we need a more preliminary result in order to prove the preliminary result! 
First, we introduce some terminology and notation. 


Definition 10.6 Let X1, X2,Y1, Yo be sets and let f : X; > Y, and gq: X2 > Yo be 
functions. We say that g extends f (or g is an extension of f) if X; C X2 and 
for all x € X1 we have that g(x) = f(z). 


Let ¢: Fy > F) bea field homomorphism and f(x) = cotca+:+-+¢n2" € Fy [a]. 
We will denote ¢(co) + (c1)a +-+++ 0(cn)x” by the notation ¢(f(x)). 


Lemma 10.2 Consider the fields Fi C E, and Fy C Ep and let @: Fi > F» be 
a field isomorphism. Take an a € E, algebraic over F, with irreducible polynomial 
p(x) € Fi[x] of a over Fy. Set q(x) = o(p(x)) € Fala]. For each zero b € Ey of q(x), 
there exists a unique isomorphic extension of ¢ from F\(a) to F2(b). 


Proof 10.4 Let d = deg(p) = [Fi(a) : Fi]. By Theorem 9.4, 1,a,...,a% 1 forms 
a basis for F(a) over F,. In other words, every element of F(a) can be written 
uniquely as C9 +cyat+++-+cg_1a*" for each c; € Fy. Since ¢ is an isomorphism q(x) 
is irreducible of the same degree as p(x) (check) and so 1,b,...,b4! forms a basis 
for Fo(b) over Fy. Define d: F\(a) > Fo(b) by 


O(co +CEa+te+++ Cicga) = $(co) + o(c1)b feet b(ca_-1)b* 1. 
We leave it to the reader to check that this ts the desired extension. 


Corollary 10.1 Let FC E be fields with E algebraic over F. Ifa € E and p(x) € 
F'|x] is the irreducible polynomial of a over F’, then the number of F-homomorphisms 
of E into F equals the number of distinct zeros of p(x). 


Proof 10.5 By Lemma 10.2, the number of F-homomorphisms from E into F is 
at least as many as the number of distinct zeros of p(x) and by Theorem 10.1 any 
F-homomorphism from E, into F must send a to a zero of p(x). Hence, the number 
of F-homomorphisms of E, into F is exactly the number of distinct zeros of p(x). 


Lemma 10.3 Consider the fields F C K C Ey and F C Ey with E; algebraic over 
F and Ey algebraically closed. Then any F-homomorphism @: K — Ey, extends to 
an F-homomorphism from E, to E4. 


Proof 10.6 Set X equal to the collection of F-homomorphisms from L to Ey, where 
Kk CLC EF, and order them by extension, i.e. 6<w iff w extends d. It is easy 
to see that this is a partial ordering. The set X is non-empty, since the original @ 
in our assumptions is in X. Finally, to apply Zorn’s Lemma, we need to show that 
X is inductively ordered. We will show that for any chain C in X, the union of the 
elements of C, call it x, is an upperbound for C in X. There are several things to 
check. First, we need to point out that y is a well-defined map, since C by definition 
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is linearly ordered. Second, x is in X since its domain contains K and it extends 
the original map @: K > Ey. Finally, x is an upperbound for C since it certainly 
extends all the elements in C. 

Therefore, by Zorn’s Lemma, there is an F-homomorphism @: E —> Ey extending 
all the elements of X with K C E C Ey. We show now that, in fact, E = E,. Suppose 
to the contrary we had ana € E, \ E. Since EF, is algebraic over F', we have that 
a is algebraic over F and so a is algebraic over E as well. Let p(x) € Elz] be the 
irreducible polynomial for a over E. Set q(x) = $(p(x)) € Ea[x]. Since d(a) is a 
zero of q(x), by Lemma 10.2, @ can be extended to a F-homomorphism from E(a) to 
@(E)(0(a)) C Ey. But this would contradict the maximality of ¢. 


EXERCISES 


1 Explain why in Example 10.7.1 that 1 is not contained in any of the elements 
of C. 


2 Verify in Example 10.7.3 that R is an integral domain. 
3 Verify in Lemma 10.2 that q(x) is irreducible of the same degree as p(z). 
4 Verify in Lemma 10.2 that ¢ is the desired extension. 


5 Verify in Lemma 10.3 that < defined in the proof is a partial ordering. 


10.4 TWO IMPORTANT THEOREMS 


In this section, we will achieve the goal of putting a bound on the size of the Galois 
group, namely that |Gal(E/F)| < [E : F]. We will also work toward the goal of 
characterizing when the inequality above is an equality. 


Theorem 10.4 Consider the fields F C E C F where F is an algebraic closure of 
F and [E: F] < oo. Then the number of F-homomorphisms from E to F is no more 
than [E: F). 


Proof 10.7 The proof is by induction onn = |E: F). Ifn=1, then E = F and so 
the only F-homomorphism is the identity map. Forn > 1 pick anya € E\ F and set 
E, = F(a). We first show that the number of F-homomorphisms from E, into F is no 
more than [E, : F'). To see this, first note that a is algebraic over F since |E: F] < o. 
Indeed, by Lemma 9.4, all of E is algebraic over F’. Let p(x) € F |x] be the irreducible 
polynomial of a over F. By Corollary 10.1, the number of F-homomorphisms from 
E, into F equals the number of distinct zeros of p(x) < deg(p) = [Ey : F}. 

Since [FE : E\] = [BE : Fl/[E, : F] < [E : FI, by induction, the number of 
E,-homomorphisms from E into Ey (check that E, is an algebraic closure of F) 
is no more than |E : Ey]. Let’s enumerate all the homomorphisms that we have 
found so far. Denote $1, 2,...,% to be the F-homomorphisms from E, into F with 
k < [E, : Fl. Denote v1, v2,.-.,Um to be the E,-homomorphisms from E into F 
with m < [E : E,]. By Lemma 10.3, each ¢; extends to 6; an F-homomorphism 
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from F into F. Notice that each composition ¢; © w; is an F-homomorphism from E 
into F and the number of such compositions isk+m < [E,: FI[E: Ei] =[E: F). 
Therefore, to complete the proof it suffices to show that any F-homomorphism from 
E into F is equal to one of the compositions $0; for some i and j. To this end, let 
¢:E > F be any F-homomorphism. Notice that the restriction of ¢ to E,, denoted 
as ¢ | E,, must equal 6; for some 1 < i < k. Consider the composition (¢;)~' 0 @ 
a homomorphism from E into F. Since d | Ey = ¢; and ¢; | Ey = 4j, for all 
be Ey we have o(b) = ¢;(b) or (¢)~1((b)) = b. Thus (¢;)~! 0 fixes E, and hence 
is an E,-homomorphism. Therefore, (¢;)~! 0 @ = y; for some 1 <j < m, and so 


b= $0 Y;. 


Corollary 10.2 For any fields F C E with [E: F] < co we have that |Gal(E/F)| < 
[E: Fi. 


Proof 10.8 This result follows immediately from the observation that E C F and so 
every F-automorphism from E onto E is an F-homomorphism from E into F. 


Example 10.8 Let’s return to the case of finite fields. We saw that 


(Opk) = {Opk, Tp2k,--., pr} C Gal( Fyn /F, x) 


Pp ’ 
with |(o,«)| = n/k so that |Gal(Fy»/F,«)| > n/k. Furthermore, since [Fyn : Fir] = 
n/k, by Corollary 10.2, |Gal(Fyn/F,x)| < n/k. Hence, |Gal( Fyn /F,«)| = n/k with 
Gal(Fyn/Fye) = (px). One consequence of this example is that for F C E finite 
fields, the Galois group Gal(E/F’) is always cyclic. 


Our next goal is to characterize the case when |Gal(E/F)| = [E: F]. The next 
result will be helpful toward that goal. 


Theorem 10.5 Let F C E C F with E algebraic over F and F an algebraic closure 
of E and F. Consider the following statements: 


1. If¢: E> F is an F-homomorphism, then ¢(E) C E. 

2. Ifo: E> F is an F-homomorphism, then ¢(E) = E. 

3. If f(x) € Fla] has a zero in E, then f(x) has all its zeros in E. 
4. E is a splitting field of some f(x) € F{z}. 


Then the first three statements are equivalent, while the fourth statement implies 
the first three statements. In addition, the first three statements imply the fourth 
statement under the condition that |E : F| < oo. 


Proof 10.9 In order to show the first three statements are equivalent, we first as- 
sume that the first statement is true and we show the second follows. Take an F- 
homomorphism @: E + E and we need to show ¢ maps onto E. Take any b € E 
and set 

X={b'€E : bandl are conjugate}. 
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Since b is algebraic over F it is the zero of some irreducible polynomial p(x) € Fz] 
with |X| < deg(p) = |F(b) : F] < oo. So we can enumerate X = {bi,b2,...,bn} 
and set Ey = F'(bi,b2,...,bn). Since each b; is algebraic over F', by Theorem 9.10, 
[E, : F| < oo. Since o(X) C X this implies that o(£1) C E, (see Theorem 10.2 
and Theorem 10.3). Set ¢1 to be the restriction of 6 to Ey. As mentioned earlier we 
can view an F-homomorphism like 6, as an F-vector space homomorphism. Since 
¢, is one-to-one with o,(F,) C E, and the vector space dimension of Ey over F is 
finite, it follows that 6, maps onto Ey. This means there is ana € Ey C E such that 
gi(a) = b and so ¢(a) = b which makes @ map onto E. 

Now assume that the second statement is true to prove the third is true. Without 
loss of generality, we can assume the polynomial f(x) € F\a] is irreducible over F’. 
Leta € E be a zero of f(x). Take any b € F any other zero of f(x). By Theorem 10.1, 
there is an F-homomorphism @ : F(a) + F(b) with ¢(a) = 6. By Lemma 10.3, we 
can extend @ tod: E> F. By assumption, ¢(E) = E and so b= (a) = d(a) € E. 

Now assume the third statement is true to prove the first is true. Let 6: E> F 
be an F-homomorphism and take a € E. Since E is algebraic over F this means that 
a is a zero of some irreducible polynomial p(x) € Fla]. Since o(a) is conjugate to a 
over F’, by assumption, ¢(a) € E and so ¢(E) C E. 

Now we show that the fourth statement implies the first (and thus, the second and 
third as well). By assumption, E = F(a1,a2,...,@n) where X = {a1,a2,...,an} are 
the zeros of some polynomial f(x) € Fla]. Let ¢: E 4 F be any F-homomorphism. 
Since each (a;) is a zero of f(x) we have that o(X) C X and so ¢(E) C E. 

Finally, we show the third statement (and thus, the first two as well) imply the 
fourth statement in the case that |E : F] < oo. Since |E : F] < oo, by Theorem 9.11, 
E = F(a1,q@2,...,@n) for some a1,02,...,4n € E. Since each a; is algebraic over 
F, each has an irreducible polynomial p;(x) over F. Set f(x) = pi(x)po(x) +++ pn(x) 
and let X denote the (finite number of) zeros of f(x). By assumption, X C E and 
since F(X) is the smallest field containing F and X, we have F(X) C E. Since 
E = F(ay, d2,...,@n) we also have E C F(X), and so E = F(X) the splitting field 
of f(a) over F. 


Definition 10.7 Let F C E C F with E algebraic over F and F an algebraic closure 
of E and F. If the first three statements in Theorem 10.5 hold for E, we say E is a 
normal eztension of F. 


As Theorem 10.5 attests, in the case that [E.: F] < oo, E being a splitting field 
of some polynomial in F’ is equivalent to E being a normal extension of F’. 


EXERCISES 


1 In the proof of Theorem 10.4 verify that FE, is an algebraic closure of F. 


2 Let PF C K CE be fields with EF a finite normal extension of K and K a finite 
normal extension of F’. 


a. Prove that if Gal(E£/F’) is abelian, then so are Gal(E/K) and Gal(K/F). 
b. Prove that if Gal(E/F) is cyclic, then so are Gal(E/K) and Gal(K/F). 
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10.5 SEPARABLE DEGREE 


As we saw in Section 10.4, counting the number of F-homomorphisms from an exten- 
sion field E of F' into an algebraic closure of F’ is the main quantity we want to pin 
down in order to get results about the size of Gal(E/F’). For this reason we introduce 
the following special notation and terminology. 


Definition 10.8 Let F C E be fields. The separable degree of FE over F, written 
[E: F]s, is the number of F-homomorphisms from E into E. 


Remark 10.3 We now tie separable degree into the previous notions we have thus 


far. 


d. 


2. 


In general, |Gal(E'/F)| < [E: Fs. 


For E algebraic over F,, any algebraic closure E of E also serves as an algebraic 
closure of F, since E is algebraic over E and E is algebraic over F. 


. The algebraic closure of F is unique up to isomorphism. Indeed, suppose E, and 


Ey are both algebraic closures of a field F. By Lemma 10.3, we can extend the 
inclusion map from F into E, to an F'-homomorphism (necessarily one-to-one) 
@ from E, to E,. Finally, we need to show that ¢ maps onto Ey (and hence Fy 
and E, are isomorphic). Set E = $(£1), an algebraically closed field (since Ey 
is algebraically closed — exercise). Take any a € E2. Since E2 is algebraic over 
F, the element a is algebraic over F. Since E is algebraically closed, it must be 
the case thata € E. Hence, E = Ey and ¢ maps onto E>. 


. From the previous remark, it follows that the value of |E : F|, is the same 


regardless of what algebraic closure of F we use (exercise). 


. Theorem 10.4 can be rephrased as follows: If |E : F| < oo, then |[E: Fl, < 


[E: Fi. 


. If E is a normal extension of F, then by Theorem 10.5.2, it follows that 


|Gal(E/F)| =([E: Fle. 


If |E : Fl] < o, then by Theorem 10.5, E is a normal extension of F 


iff |Gal(E/F)| = |E: F],. Note that for the direction in which we assume 
|Gal(E/F)| =|E: F],, we need |E : F] < co in order to have E algebraic over 
F and so, by the first remark, E serves as an algebraic closure of F. Hence, 
considering F-homomorphisms from E into E is the same as considering them 
from E into F. 


. The second paragraph in the proof of Theorem 10.4 (with a bit of thought) 


essentially gives the result that [E : Fl, = [E.: K].[K : F]s for any fields 
F CK CE under the assumption that [E : F] < oo. 
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Example 10.9 We illustrate separable degree with an example. 


1. 


In preparation for the second example we point out that outside of every field 
F there exists a transcendental element. Indeed, Set E = F(x) D F where x is 
an indeterminate not mentioned in F. Then x is transcendental over F, since 
if it were algebraic over F' there would be a polynomial f(x) € Fx] of degree at 
least one with f(x) =0 (here we have evaluated f(x) at x = x). But then f(x) 
would be the zero-polynomial, a contradiction. 


. We give an example of the case when |E: Fs, is strictly smaller than [E : F). 


Consider the field Z, and set F = Z,(t) where t is transcendental over Zp. 
Notice that p(x) = a? —t € F[a| has no zero in F, for otherwise t would be 
algebraic over Z,. Indeed, suppose that f(t)/g(t) € F were a zero of p(x). Then 
f(t)? /g(t)? —t = 0 which implies that f(t)? — tg(t)? = 0 and so t would be a 
zero of the polynomial f(x)? — xg(x)? € Z,[z]. Choose any zero a € F of p(z) 
and set E = F(a) C F. Since char(F) = p, we have that (x — a)P = x? — a? = 
x? —t = p(x). Thus, a is a zero of p(x) of multiplicity p and hence the only 
zero of p(x). Now this implies that any F-homomorphism from E into E must 
send a to a and thus this map must be the identity homomorphism. In sum, 
[E : F], = 1. Since p(x) is the irreducible polynomial for a over F (p(x) has 
no zero in F'), we have that |E : F] =p and so [E: Fl, < [E: F]. 


We now introduce some more terminology using the same word separable and 
show how the two notions are connected. 


Definition 10.9 Let f(x) € F[z] andaé F be a zero of f(z). 


1. 


The element a is a simple zero of f(x) if (a —a)| f(x), yet it’s not the case 
that (x —a)?| f(z). 


. An irreducible polynomial in F |x| is separable over F if all its zeros are 


simple. 


. An arbitrary polynomial in F'\x| is separable over F' if all its irreducible 


factors are separable over F. 


. An element a algebraic over F' is separable over F if its irreducible polynomial 


is separable over F’. 


. A field E is separable over F if E is algebraic over F and each element of E 


is separable over F’. 


Theorem 10.6 Consider the field F C E with [E: F| < oo. E is separable over F 
if |B | = eek]: 


Proof 10.10 First assume that E is separable over F. We show that |E:: F], = |E: 
F| by induction on n = [E': F]. In the case that n = 1 we get that E = F and 
so the only F-homomorphism from E into E is the identity homomorphism. Hence, 
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[E: Fl], =1=[E: F). Forn >1, choose ana € E\F and set K = F(a) and let p(x) 
be the irreducible polynomial for a over F. By induction, |E : K], = |E : K] (since 
E separable over F implies that E separable over K — the irreducible polynomial of a 
over K divides the irreducible polynomial of a over F'). Since E is separable over F 
the irreducible polynomial p(x) has only simple roots and hence has as many zeros as 
its degree. As we have seen, for each zero of p(x) there is an F-homomorphism from 
K into K which sends a to some zero of p(x). Hence, [K : F\; equals the number of 
zeros of p(x) which equals deg(p) = |K : F] and so 


(Bar) = (6: KK Se klk cP l= (EF ls: 


Now assume that |E: F), =|E: F| and we show E is separable over F. Take any 
a € E and set K = F(a). By our earlier remarks, we know that |[E: K|, < [E: K] 
and |K : F], <|K: F]. But since 


[Bi Fle=|2: KK: Fl,< [22 A|[K > FF] =(2 2 FI, 


and [|E': Fl], = [E: F], then it must be the case that, in fact, [E : K], = [E: K| 
and [Kk : F|, = [K : F]. In particular, the fact that |K : F], = [K : F] says that 
the irreducible polynomial for a over F' has as many conjugates as the degree of p(x). 
But that could only mean that all the zeros of p(x) are simple. 


Hence, we have now reached the main goal of the last few sections. 


Corollary 10.3 Let F C E be fields with [E : F| < co. Then |Gal(E/F)| =|E: F] 


iff E is both a normal and separable extension of F’. 


Proof 10.11 Since [E : F] < «, we know that |Gal(E/F)| < [E: Fl, < [E: F. 
First assume that |Gal(E/F)| = [E : F]. By our work above, we have that [E : 
F\, = [E.: FI, which by Theorem 10.6, implies that E is separable over F and 
|Gal(E/F)| = |E: Fs, by an earlier remark implies that E is normal over F'. Now 
assume that E is both normal and separable over F’. Citing the same references in the 
previous two lines we get that |Gal(E/F)| = |E: F], and |E: F], =([E : F| which 
implies that |Gal(E/F)| = [E: F]. 


EXERCISES 


1 In Remark 10.3.3, verify that E) is algebraically closed. 
2 Verify the statement made in Remark 10.3.4. 


3 Verify the statement made in Remark 10.3.8. 


10.6 GALOIS EXTENSIONS 


Our next major goal is the Fundamental Theorem of Galois Theory which relates 
the lattice of subgroups of Gal(E/F’) to a lattice of certain intermediate subfields 
between F and F’. A key assumption of this result is that E’ be a Galois extension of 
F.. In this section, we will define and explore this idea. 
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Definition 10.10 Let F C E be fields. 


1. For any field E let Aut(E) denote the group of automorphisms of E with com- 
position. 


2. For any field E and any subgroup G < Aut(E), the G-invariant subfield of E, 
denoted by EG, is the collection of all elements of E which are fixed by every 
element of G, i.e. 


ES ={a€ E| ¢(a)=a forall ¢€ G}. 


Remark 10.4 1. One can easily check that E® is a subfield of E. (exercise) 


2. If G < Gal(E/F), then F C E© C E. In general, if K is a subfield of E 
containing F, then K is called an intermediate subfield of E over F’. 


Example 10.10 Here, we give some examples of G-invariant subfields. 


1. Set E=Q(V2) and F =Q. Since, by Theorem 10.3, any element of Gal(E/F) 
is completely determined by where it sends the zeros of x* — 2 contained in E, 
then |Gal(E/F)| = 2 and consists of the identity homomorphism and the F- 
homomorphism sending W2 to —W2. Call this second map ¢. Since «4 — 2 is 
the irreducible polynomial for W/2 over F (Eisenstein), by Theorem 9.4, a basis 
for E over F is 1, 72, (W2)? = V2, (W2)3. Hence, the elements of E have the 


form 
Co 4 v2 coV2 t c3(W/2)°, 


where Co, C1, €2,¢3 € F. Let G = Gal(E/F) and we compute E@ in this setting. 
Since the identity homomorphism fixes everything, to find E© it suffices to find 
out what the second element of G fixes. First note that 


(V2) = of V2?) = 00/2)? = (-V2)? = V2 and 


olV2] = o(2)? = (-V2)? = 7. 
Therefore, ¢ fixes an element of E iff 


(co + 12 + coV2 + €3(72)8) = co + 1 V2 + CoV2 + 03(V2)? iff 
co + c10(V2) + c29(V2) + c36((W2)?) = co + V2 + coV2 + c3(V2)? iff 
co — 1 V2 + V2 c3/2° = c + V2 + coV2 + 3(W2)? iff 


cy = —c, and cz = —C3, te. c, = C3 = 0. Hence, the elements of E fixed by @ 
look like cy + coV/2, which is precisely the elements of Q(./2). In other words, 
ES = Q(v2). 
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2. Recall our work investigating the Galois group of E = Q(V2, V3) over F = Q. 
We described a typical element of E as having the form atbV2+cV/3+dy6 and 
we found Gal(E/F’) to be the Klein-4 group having the following four elements: 


a+bV2+cev3+dV6 + a+bV2+cv3+ dv6, 


atbV2+cV34+dvV6 4 a—bV2+cvV3 -dvé6, 
a+bV24+cV34+dV6 4 at+bV2—-cv3- dv6, 
abV2 o/s +dv6 . 4—bV2 = 0/3 dvb. 


Let’s name these homomorphisms as 1, ¢1, 2 and $3, respectively. Consider 
the following subgroups of Gal(E/F): Go = {1}, Gi = {1, ¢1}, Go = {1,42} 
and G = Gal(E/F). First, let’s compute E@ which is dependent on what $1 
fixes. Notice that 


or(a + bV2 + cV3 + dV6) = a+bV2+cV3+4+dv6 iff 


a— bV2+ceV3 —dV¥6=at+bV2+cV34+dvV6 iff 


b=d=0 and so E™ = Q(vV3). Ina similar way we get that E®? = Q(V2). 
Furthermore, since elements of E° must be fixed by both ¢, and $2 we get that 
ES = F. Finally, E© is certainly E. 


3. Refer to Example 10.9 where F = Z,(t) with t transcendental over Z, and 
E = F(a) where a € F \ F was the only zero of x? — t. We saw that |E : 
F), =1 and so |Gal(E/F)| = 1 which means Gal(E/F) has only the identity 
homomorphism. Thus, if we set G = Gal(E/F), then ES = E, the whole field. 


Amazingly enough, there is a connection between the Gal(£/F)-invariant sub- 
fields of E and the concepts of normality and separability. Before we can state this 
result we need another definition. 


Definition 10.11 Let F C E be fields and set G = Gal(E/F). The field E is said 
to be Galois over F if E° = F. We also say that E is a Galois extension of F. 


Example 10.11 Referring to the previous three examples just presented, 
1. E=Q(¥2) is not Galois over F = Q, since ES = Q(V2) F F. 
2. E=Q(V2, V3) is indeed Galois over F = Q, since ES = F. 
3. E=Z,(t)(a) is not Galois over F = Z,(t), since Eo = EF F. 


So the question is what makes some field extension Galois while others are not. 
The following theorem, whose proof we present in the next section, gives the reason 
in the nutshell: 


Theorem 10.7 Let FC E be fields with |E : F) < oo. The field E is Galois over F 
iff E is both normal and separable over F. 
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Example 10.12 Referring again to the previous three examples, 


1. The reason why E = Q(W2) fails to be Galois over F = Q is that E is not nor- 
mal over F. Indeed, not every zero of x*—2 is contained in E as Theorem 10.5.3 


requires. 


2. The above theorem is confirmed by the fact that E = Q(V/2, V3) is Galois over 
F=Q. Indeed, since |Gal(E/F)|=4=[|E: F], by Corollary 10.3, this implies 
that E is both normal and separable over F 


3. The reason why E = Z,(t)(a) fails to be Galois over F = Z,(t) is that E is not 
separable over F’. Indeed, we computed [E : F], =1Ap=|E: F| which by 
Theorem 10.6 shows that E is not separable over F.. 


EXERCISES 
1 Let E = Q(¥2,¢) where ¢ = —1 + ¥8i. Let F = Q and G = Gal(E/F). 


a. 


b. 


Verify that ¢? = 1 and the three roots of x? — 2 are V2, CW2 and (7 V/2. 
Find an irreducible polynomial having ¢ as a root. 


Compute [E : F] and use this to put a bound on the number of elements in 


G. 


Now compute the size of G by listing an associated permutation for each 
element of G. 


Explain why G is not abelian. 


Give a nice description of the elements of FE as F-linear combinations of a 
specific basis. 


Compute the invariant subfield EB” for any subgroup H of order two of your 
choosing. 


2 Verify Remark 10.4.1. 


10.7 SOME PRELIMINARY THEOREMS 


There are two results to prove in this section. The second result is Artin’s Lemma and 


has a surprisingly simple proof based on elementary ideas from linear algebra. But 
first, as promised, we provide the proof of Theorem 10.7 mentioned in the Section 10.6. 


Proof 10.12 First assume that E is both normal and separable over F with |E : 
F] < oo. Set G = Gal(E/F) and K = E® and we will show that K = F and 
so E' is Galois over F. To see this, note that by Theorem 10.5.4, E. is the splitting 
field over F of some polynomial f(x) € F\a] and by Corollary 10.8, |Gal(E/F)| = 
[E : F). Now since F C K, it is also the case that E is the splitting field over K 
of the same f(x) € K[a]. Again, by Theorem 10.5, E is normal over K as well. 
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Since FE is separable over F it is separable over the intermediate field K as well (the 
justification is identical to a portion of second half of the proof of Theorem 10.6). 
Now, by Corollary 10.3, we have that |Gal(E/K)| = [E: K]. Observe since K = E® 
it is the case that Gal(E/F) = Gal(E/K) and so |E: F] =|E: K]. But this implies 
that |K: F] =1landsok=F. 

Now we assume EC = F with [E : F] < co. Take anya € E and let p(x) € F[z] 
be the irreducible polynomial of a over F. To show that E is both normal and separable 
over F’, it is enough to show that p(x) has all its zeros in E and they are all simple 
zeros. Let 


G = {¢1, b2,---,¢n} and set X = {d;(a) | 1<i<n}, 


which consists of the zeros of p(x) contained in E. Let’s enumerate X = 
{a1,@2,...,a~} where k < n. Note that a € X, since the identity homomorphism 
is in G. Set q(x) = T[_,(a—a;) € Ela]. Certainly, by how q(x) is defined, it has only 
simple zeros with all its zeros in E, so in order to show that p(x) has all its zeros in 
E and they are all simple zeros, it’s enough to show that p(x) and q(x) differ only by 
a constant in F. To prove this we will show that q(x) € F|a], for if this were so, then 
p\q (Lemma 9.2.1) and since q\p and p(x) is irreducible, we would then have that p(x) 
and q(x) differ only by a constant in F. Now in order to show that q(x) € F\a], we 
set q(x) = bo tba +--+ +bpa* and show that each b; € ES (which by assumption 
implies each b; € F and the result is proved). To this end, take any ¢; € G. On the 
one hand, 


0; (q(a)) = (bo) + 3 (bi)x ate Beh ob; (b) x”. 


On the other hand, 
k 


(q(x) = [[ (x — $;(a:)), 


i=1 
which follows from the definition of polynomial multiplication and properties of a field 


homomorphism. Since, by Theorem 10.3, each @; € G defines a permutation of the 
elements of X, we have that 


k k 


$;(4(x)) = |] (@ — 6;(a)) = []@ — a) = a(2). 


i=1 i=1 

But then 

which implies that $;(b;) = ); fori =0,1,...,k. Since @; was chosen arbitrarily in 
G it follows then that each b; € E@. 


Lemma 10.4 (Artin) Let F C E be fields with |E : F| < oo. If H < Gal(E/F) 
such that E¥ = F, then [E: F] < |H]. 
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Proof 10.13 Set H = {1,¢2,...,¢n} with ¢1 being the identity homomorphism 
on EF. Let a,,a2,..-,Q4m be a basis for E over F and suppose, to the contrary, that 
m>n. Consider the following homogeneous system of equations: 


$1(a1)%1 + b1(a2)£2 + +++ + 1(Gm)&m = 0 
2(a1)%1 + b2(a2)%2 + +++ + G2(Gm)%m = 0 


sea aetna 6 


Since m > n, the system of equations has non-trivial solutions in E'. Let 
(b1, b2,...,bm) € E™ be a non-trivial solution containing a maximal number of ze- 
ros. Without loss of generality we can assume that b) 4 0 (by simply reordering 
the ¢;’s). Furthermore, without loss of generality, we can assume that b, = 1, since 
(by; ‘by, by 'bz,..., by bm) is also a solution to the system. 


Claim 10.2 Each b; € E™. 


Suppose, to the contrary, the Claim were false. Without loss of generality, we 
can assume by € E” (again, by reordering) so that ¢;(b2) 4 b2 for some j. Since 


(1, bg,...,m) ts a solution to the system, we have 
1 (a1) + b1(a2)b2 + +++ + b1(am)bm = 9 
b2(a1) + b2(a2)b2 + +++ + 62(Gm)bm = 0 
n(a1) + bn(d2)by +++ bn (din)Bm = 0 
Now apply that particular @; to each equation in the system to get 
(1 (a1)) + O;(1(a2))b5(b2) + +++ + Oj (b1(Gm)) Oj (bm) = 0 
j(b2(a1)) + 5 (G2(a2))b5(b2) + +++ + 65 (G2(4m)) 0; (bm) = 0 


PACU ES CR CO RCA ee CRC RIENCE Ee 


Since, by Cayley’s Theorem, 6; permutes the elements of H, by reordering the 
equations we can obtain 


b1(a1) + b1(a2)b;(b2) + +++ + $1(Gm) bj (bm) = 0 
b2(a1) + G2(a2)bj(b2) + +++ + $2(am) Gj (bm) = 0 
n(a1) + bn(a42)0j(b2) + +++ + bnaldn)®;(Pm) = 0 


This implies that (1, 6;(b2),...,@;(bm)) is another solution to the original system 
of equations. But then the difference, 


(1, 095025 bm) — (1,07 (62), 25:7 (bm)) = (0, ba — 07(63),..+., bm =O (Om) 
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is yet another solution to the system. Notice, however, that this last solution is a non- 
trivial solution (since 6; (bz) 4 bz) with a greater number of zeros than (1, b2,...,bm), 
a contradiction. Hence, the Claim is proved. 

The Claim together with the assumption that E” = F implies that the b;’s are, in 
fact, in F. Now observe, in particular, that (b1, b2,...,bm) satisfying the first equation 
of the system (recall that ¢, is the identity homomorphism on E)) yields the equation 
bya, + boag + +++ + bmam = 0. This is a non-trivial F-linear combination of the basis 
G1, 42,---,Am of E over F' equaling zero, a glaring contradiction. 


EXERCISES 


1 Verify in the proof of Theorem 10.7 that since E’ is separable over F it is 
separable over the intermediate field K as well. 


2 Consider Exercise 1 in Section 10.6. 


a. Determine a subgroup H < Gal(E/F) for which E” = F. 


b. Confirm the conclusion of Artin’s Lemma 10.4. 


10.8 THE FUNDAMENTAL THEOREM OF GALOIS THEORY 


We have already seen a link between groups and fields in regards to the Galois group 
Gal(E/F’) and the degree [FE : F’]. However, the link goes further and is more profound 
— fundamental even. 


Theorem 10.8 (The Fundamental Theorem of Galois Theory) Let F C E 
be fields with [E: F| < co and E Galois over F’. The following are true: 


1. There is a one-to-one inclusion reversing correspondence between intermediate 
subfields of E over F and subgroups of Gal(E/F) given by the map K 
Gal(E/K) and its inverse H+ E? in the sense that 

(a) IfF C Ky C Ko CE, then Gal(E/K2) < Gal(E/k4) 
(b) If Hy < Ho < Gal(E/F), then FC EE? CE™ CE, 


2. If K is an intermediate subfield of E over F, then 
[E : K] = |Gal(E/K)| and [K : F] = [Gal(E/F) : Gal(E/K)}. 
3. If H < Gal(E/F), then 
[B: B*) =|A| and [E” : F] =[Gal(E/F) : H]. 


4. For K an intermediate subfield of E over F, we have that K is normal over F 
iff Gal(E/K) < Gal(E/F) and in this case 


Gal(K/F) © Gal(E/F)/Gal(E/K). 
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5. For H < Gal(E/F), we have HAGal(E/F) iff E™ is normal over F and in 
this case 


Gal(E” /F) = Gal(E/F)/Gal( E/E"). 


Before we give the proof of this result, let’s first illustrate the theorem with an 
example. 


Example 10.13 Set E = Q(wW2,i) and F = Q. First note that any element of 
Gal(E/F) is completely determined by where it sends the generators W2 and i. Sec- 
ond, note that any element of Gal(E/F) must send W2 to another zero of x* — 2, 
namely 4W2 and tiwW2. Third, note that any element of Gal(E/F) must send i 
to another zero of x? +1, namely +i. Therefore, there is an element of Gal(E/F) 
corresponding to each of the following permutations: 


V2 —W2 iV2 -iV2 i -i V2 -W2 iV2 -iW2 i —i 
V2 —-W2 iV2 -iV2 i -i —V2 v2 -iV2 iV2 i —-i 


V2 -VY2 iwW2 -iW2 i -i V2 -W2 iW2 -iW2 i -i 
Vo 22 age a2 =e. G Slo - WD 7 fe a> 


UO AID. GY A GD gg V2 —-W2 iw2 -iV/2 i -i 
I/D AID a9 V2. 4 4 12. S192. AD SD S44 


V9. =Y2 72 92 4 ai V2 =F 492 sa/2° 5 =a 
SEO Gx Oe JO: - SAD. ae iy Oe ev ee 0 a rn 


So at this point we know that |Gal(E/F)| > 8. Now we compute the value of 
[E : F]. Consider the following chain of fields: F C Q(W2) C E. Since «+ — 2 is 
irreducible over Q (Eisenstein) and has W2 as a zero, we get that [Q(W2) : F] = 4. 
Since x? +1 is irreducible over Q(W2) (else i € Q(W2) C R, a contradiction) and 
has i as a zero, we get that [E : Q(W2)] = 2 and so [E : F] = (4)(2) = 8. Now since 
|Gal(E/F)| < [EB : F] =8 and by the above work we get that |Gal(E/F)|=8=|E: 
F|. Hence, we know all the elements of Gal(E/F) and we also know that E is Galois 
over F which puts us in the context of the Fundamental Theorem of Galois Theory. 

Let’s now decide which group of order eight the Galois group is. There are five 
groups of order eight (up to isomorphism): there are the abelian ones Zg, Z4 ® Zo, 
Z2 © Zo ® Ze and there are the non-abelian dihedral group and the quaternions. 
Consider the eight permutations listed earlier. The fifth such, call it bd, has order four 
and the third such, call it W, has order two. One can check (exercise) that ow 4 wd 
and so the Galois group is non-abelian. In fact, one can check that @ and w generate 
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Gal(E/F) 
— ae 
a ss 
7 oe 
{ 1,47, uv, du } { 1,4,¢7,¢* } { 1, 67, dw, d*v } 
ae “fo — ~ 
{1,0} {1,2 } {1,67} { 1, dv } { 1,430 } 
ie ae ee ; a _ a 
it ele 
{1} 


E 
ee a 

— wr oe Oe cis 

Q72 Qi v2) Q( v2, i) Q.V2+iv2) = Q(W2—- iV?) 
™ — —~ a 
at il Rt a 
= | | | 2 
Q( v2) Q(i) tiv?) 
“_ Pa 
F 


Figure 10.2 The lattice of intermediate fields for F C E, where E = Q(W2,i) and 
F=Q. 


the Galois group and satisfy the relation Wd = 6° which is precisely the presentation 
for the dihedral group. In Figure 10.1, we have the lattice of subgroups for Gal(E/F’). 
A basis for E over F is 1, W2, V2, (W2)°, i, iv/2, iv/2, i(W2)> so that every 


element of E can be expressed as 
cr + co72 + c3V2 + c4(72)3 + c5i + cot 2 + criV’2 + cgi( 2)3, 


where C1, C2, C3, C4, C5, C6, C7, C3 € F. Using this representation one can proceed to com- 
pute the invariant subfields of E corresponding to each subgroup in the lattice of 
Gal(E/F) presented above. Such work would yield the lattice of intermediate sub- 
fields of E over F presented in Figure 10.2. 

Notice the inverted symmetry of the two lattices as predicted in the Fundamental 
Theorem of Galois Theory. 


We now give the proof of the Fundamental Theorem of Galois Theory. 
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Proof 10.14 To show the first statement, it is enough to show the composition of 
the two maps K +> Gal(E/K) and H > E®™ (and vice-versa) is the identity. First, 
for K an intermediate field of E over F, to show that EC™/*®) = K it is enough 
to show that E is Galois over K. To prove this, by Theorem 10.7, we show that 
E is both normal and separable over K. To see that E is normal over K, consider 
a K-homomorphism @: E + E. Since ¢ is also an F-homomorphism and E is 
normal over F’, we have that ¢(E) C E and this shows that E is normal over K. 
Just as in the proof of Theorem 10.7, E being separable over F' implies E is separable 
over any intermediate field K. Second, we show that Gal(E/E") = H. Certainly, 
Gal(E/E") D> H (by the definition of Gal(E/E")). For the reverse inclusion, by 
Artin’s Lemma, [E : E#| < |H| and by Corollary 10.2, |Gal(E/E")| < [E: BE"). 
Hence, |Gal(E/E")| <|H| and so Gal(E/E") = H. 

To prove second statement, as we saw in the first statement, E is both normal 
and separable over K so that by Corollary 10.3 it follows that |Gal(E/K)| = |[E: K]. 
To show that [Kk : F] = |Gal(E/F) : Gal(E/K)], using Lagrange’s Theorem, 


[K : F] =|E: F\/[E: K] =|Gal(E/F)|/|Gal(E/K)| = [Gal(E/F) : Gal(E/K)). 


The third statement follows immediately from second statement and the one-to- 
one correspondence established in first statement. 

To prove the fourth statement, first assume that some intermediate field K is 
normal over F. Take ¢ € Gal(E/F) and w € Gal(E/K). By assumption, for any 
a € K we have that ¢(a) € K and so W(¢(a)) = ¢(a). But then d-wWd(a) = a 
which implies that 6-!W¢d € Gal(E/K) and so Gal(E/K) < Gal(E/F). For the 
opposite direction we prove the contrapositive statement. Suppose some intermediate 
field K is not normal over F. Then for some ¢ € Gal(E/F) and some a € K 
we have that d(a) ¢ K. By the first statement, K = ECUE/K) | so there exists a 
w € Gal(E/K) such that w(¢(a)) 4 O(a) which implies that 6~'Wd(a) 4 a and so 
@ ‘wo ¢ Gal(E/K). Therefore, it is not the case that Gal(E/K) < Gal(E/F). 

To establish the isomorphism, consider the map V : Gal(E/F) > Gal(K/F) by 
U(¢) = o|K, the restriction of @ to K. Since K is normal over F this function ® 
maps into Gal(K/F), because ¢|K will map back into K. It is easy to check (exercise) 
that ® is an epimorphism with kernel equaling Gal(E/K), hence by the Fundamental 
Theorem of Homomorphisms the result follows. 

The fifth statement follows immediately from the fourth statement and the one- 
to-one correspondence established in first statement. 


Example 10.14 Here are some additional examples illustrating the Fundamental 
Theorem of Galois Theory. 


1. We have already investigated the case when E = Q(V2, V3) and F = Q in 
Example 10.10.2. We found that E was Galois over F. We computed the Galois 
group of E over F to be the Klein-4 group. The intermediate subfields of E over 
F are Q(V2), Q(V3) and Q(V6) and these were each found to be invariant 
subfields of E via the three subgroups of order 2 in Gal(E/F'). When you write 
out the lattice of subgroups and the lattice of intermediate subfields you obtain 
identically shaped lattices. 
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2. We have also looked at E = Q(W2) over F = Q. The Galois group of E over 
F was found to be Zz and so its lattice is of the simplest type having no proper 
non-trivial subgroups. However, the lattice of intermediate subfields of EF’ over 
F has a strictly intermediary subfield, namely Q(V/2), so the two lattices do not 


have the inverted symmetry. This is because E is not normal over F’. Note that 
Q(/2) = E® where G = Gal(E/F). 


3. Recall the example of F = Z,(t) where t is transcendental over F and E = F(a) 
where a is a root of x? -a=(x—a)?. Setp=2, L= F(a,i) and consider the 
Galois group of L over F. Any F-homomorphism in G = Gal(L/F) must fix 
a and send i to +t. Therefore, G = Zo and its lattice is again of the simplest 
type. However, the lattice of intermediate subfields of L over F includes three 
strictly intermediate subfields: E = F(a), F(i) and F(ai) and once again we 
do not have the inverted symmetry of the two lattices. This is because E is 
not separable over F'. Notice that of the three subfields only E is an invariant 
subfield, since E = L@. Indeed, since 


L={egotcja+ cit cai : G € FH, 
and the only non-trivial map in G is the one sending i to —i, then 


Lo ={a+aqa : Ge F}=F(a)=E. 


EXERCISES 


1 Write out the lattice of subgroups and the lattice of intermediate subfields for 
Exercise 1 in Section 10.6, 


2 In Example 10.13, verify that ow 4 Wo. 


3 In Theorem 10.8.4, verify that ® is an epimorphism with kernel equaling 
Gal(E/K). 


4 Write out the lattice of subgroups and the lattice of intermediate subfields for 
Example 10.14.1. 


10.9 SOLVABLE GROUP ESSENTIALS 


We spent a whole chapter on solvable groups, but for the reader who prefers to cover 
only as much of the topic as needed to see the proof that there is no quintic formula, 
we present this material here in a self-contained manner. For those who have covered 
the chapter on solvable and nilpotent groups, you may wish to simply skim this 
section since most of it you have already seen. 


Definition 10.12 A group G is called solvable if it has a series of subgroups 1 = 
Go AG, d--+- dG, =G with each Gi41/G; abelian. 


Example 10.15 Here, we list several examples of solvable groups. 
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1. Every abelian group G is solvable with series 1G. 
2. S3 is solvable with series 1d A3 <1 S3, since As = Z3 and S3 = Zo. 
3. S4 ts solvable with series 1<1.N < Aq <1 S4 where 
N = {1, (1 2)(8 4), (1 3)(2 4), (1 4)(2 3)}. 
Note that |N| = 4 and all groups of order 4 are abelian. 
Our primary goal in this section is to show that S,, is not solvable for n > 5. 


Proposition 10.1 Jn the case that G is a finite group, G is solvable iff each Gi41/Gi 
in the definition of solvable has prime order. 


Proof 10.15 One direction is immediate, since a group of prime order is isomorphic 
to Z, which is abelian. For the other direction, suppose that G is solvable. For each 
factor group Gi41/G; which does not have prime order we refine the series as fol- 
lows: Suppose that p is a prime dividing the order of Gi41/G;. By Cauchy’s Lemma, 
Giai1/G; has an element of order p, say gG;. Since Gi41/G; ts abelian, the subgroup 
(gG;) of order p is normal in Gi41/G;. Now, by the Correspondence Theorem, there 
exists a subgroup H 1 G;41 containing G; such that (gG;) = H/G;. Note that G;<H, 
since G; 1 Gi41. Repeat this process on the smaller abelian factor group Gij44/H. In 
this way and in a finite number of steps we can replace G; <1 Gi41 in the series by a 
subseries where each factor group is of prime order (note that we are implicitly using 
the Third Isomorphism Theorem for groups). 


Definition 10.13 Let X be a non-empty subset of a group G. The subgroup gen- 
erated by X, written (X) is the collection of all finite products of elements of X 
and their inverses. The set X is called the generating set of (X). 


If X = {91,92,---;9n} a finite set, then we write (91, 92,---,;9n) for (X). Note 
that if X = {g}, then (X) is simply the cyclic subgroup generated by g. One needs 
to check, of course, that (X) is indeed a subgroup of G. Furthermore, one can show 
that (X) is the smallest subgroup of G containing the set X. 


Example 10.16 Here are some examples of groups and their generating sets. 
1. The Klein-4 group V = {e, a,b,c} is generated by X = {a,b}. 
2. The quaternions are generated by the set X = {i, 7} or {i,k} or {j,k}. 


3. The dihedral group D4 (rotations and reflections of a square) is generated by 
any single (non-trivial) rotation and any single reflection. 


Definition 10.14 Let G be a group. 
1. For g,h€ G, the commutator of g and h, written 


(9, h] = gh on: 
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2. Let X and Y be two non-empty subsets of G. The commutator subgroup of 
X and Y, written 


[Xe SC ay) or eX, Be Ys 
the subgroup generated by the commutators [x, y}. 


3. For a group G, the derived subgroup of G (or sometimes called the com- 
mutator subgroup of G), written G' = [G,G]. 


Remark 10.5 Since for all g,h € G we have [g,h|~! = [h, g], it follows that G" is, 
in fact, the collection of all finite products of commutators in G (no need for their 
inverses). Another easy fact to verify is that G’ is normal in G. We prove some 
additional properties of G' in the theorem which follows. 


Theorem 10.9 Let G and K be groups and H be a subgroup of G. 
1. If6:G—>K a homomorphism, then ¢(G’) < Kk’ 
2.G'<H iff HAG and G/H is abelian. 


3. G' is the smallest normal subgroup of G which will form an abelian factor group. 


Proof 10.16 For the first statement, take any [g1, hi]-+-[9n, hn] € G’. By properties 
of a homomorphism, 


P([91, ha] +++ [Gns nl) = O(L91, Ral) +++ (Gn Pn]) = [d(91), O(h1)] ++ [(Gn), O(hn)] € A". 


For the second statement, first assume that G’ < H. Then for all g € G and 
h € H, we have g-thg = h(h-'g-thg) = h{h,g| € H and so HAG. To see that G/H 
is abelian, notice that for g1,g92 € G, 


HHH = g.HoHg, Hoy Ho HH = gH gH |g1, g2|H = 92H gH. 
Now assume that H AG and G/H is abelian. Then for all gi, g2 € G, we have 
g, Hoy Ho HH = gp Ho Hoy! Ho2H = H, 


and so |g1,92)H = H. Hence, [g1, 92] € H and so by Remark 10.5, it follows that 
GH: 

The third statement is simply a summary of the first two statements and requires 
no additional proof. 


Definition 10.15 For any natural number n, the nth derived subgroup, written 
G™), is defined recursively as follows: G2 = G, GY =G' and Ge) = (GM) 
forn>1. 

The series, G = GO > GY > G@) >... is called the derived series of G. 
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The following facts about derived series will be left as exercises: 


1. If the derived series of G terminates at 1 at some point, i.e. there exists a 
natural number n such that G”) = 1, then G is solvable. 


2. If H<G, then H™ < G™ for any natural number n. 


Proposition 10.2 G is solvable iff there exists a natural number n such that G™) = 
1. 


Proof 10.17 One direction was given as an exercise above. For the other direction 
assume that G is solvable. Then G has a series 1 = Gop d G1 d--- dG, =G with 
each Gi41/G; abelian. We show by induction oni that COG G5 fori =0,1,...,n. 
For i = 0, certainly G = GO Gy. = G. For 0 << nz Since Gye] Gu 48 
abelian, by Theorem 10.9, (Gn—i41)' C Gn_i. Now, by induction, GO-Y) C Greg 
Therefore, by the exercise above 


GUNG ey Ghana) Gre, 
In particular, G@™ C Gyn = Go = 1 and the result follows. 


Theorem 10.10 Jf G is a solvable group, then 
1. For each H < G, we have that H is also solvable. 


2. For each NG, we have that G/N is also solvable. 


Proof 10.18 Since G is solvable it has an abelian series, i.e. there is a subnormal 
series 1= Go < G1 <-++- < G, =G with each Gi41/G; abelian. To prove the first 
statement, consider the series 


1=ANG)< ANG, <::-<HOAG, =H. 


One can easily verify that the series is subnormal and by the Second Isomorphism 
Theorem, each 


AN Giai/HO G; = (AA Gi41)Gi/G; < Gini /Gi- 


In other words, HN Gij41/HOAG is isomorphic to a subgroup of an abelian group 
and is therefore abelian as well. Hence, 1 = HNGo< HANG, <:-:--< HAG, =H 
is an abelian series and so H is solvable. 

To prove the second statement, consider the series 


{N} = Go < GiN/N <---<GaN/N =G/N. 


By the Third Isomorphism Theorem (part 1), each G;N/N < Gi4i1N/N, since 
GN and N are both normal in Gi41N (check). By the Third Isomorphism Theorem 
(part 2), 

(Git N/N)/(GiN/N) = Giti N/GiN = Gi41/Gi, 
which we know to be abelian. Hence, {N} = Go < GiN/N <---<G,N/N = G/N 
is an abelian series for G/N and so G/N is solvable. 
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In preparation for showing that S, is not solvable for n > 5 we need the following 
lemma: 


Lemma 10.5 /fn > 3 ando © Ay, then o can be expressed as a product of 3-cycles. 


Proof 10.19 It’s enough to show any pair of transpositions, Tr’, can be written as 
a product of 3-cycles. There are three cases to consider. If tr' = (a b)(a b), then 
tT’ =(abc\(abc\(a bc), where c is different from a and b (note that n > 3). If 
tr =(e O)(a.c), thenar =(a ob). arr =a bed); then zr =(e bob ed), 


Corollary 10.4 [fn > 5, then S, is not solvable. 


Proof 10.20 First note that Ai, = An. Indeed, by Lemma 10.5, it is enough to show 
any 3-cycle is a commutator. Take any 3-cycle (a bc) andd and e distinct from a, b,c 
(note that n > 5). Then 

(abe) =(abalac fad da fd=((abd),(ac fy 


Second note that S!, = Ay. Indeed, S\, C An, since An dS, and S,/A, = Zo 
abelian (see Theorem 10.9). Then Ay, = Al, C S!, yields the equality. But then for 
any k, 5® = An #1 and so by Proposition 10.2, Sy, is not solvable. 


EXERCISES 


1 Check that the subgroup N in Example 10.15.3 is the Klein-4 group. 
2 Verify that the factor group Gj4,/H in the proof of Theorem 10.1 is abelian. 
3 Prove that for a group G we have G’ dG. 


4 If the derived series of G terminates at 1 at some point i.e. there exists a natural 
number n such that G(” = 1, then G is solvable. 


5 If H <G, then H”™ < G™ for any natural number n. 


10.10 SOLVABILITY BY RADICALS 


Any quadratic polynomial ax?+bx+c has roots which are solvable by radicals (formal 
definition to come) via the quadratic formula 


—bt VJ? — 4ac 


2a 


In the 1500’s italian mathematicians demonstrated that first any cubic polynomial 
and later any quartic polynomial is solvable by radicals. The quartic is rather messy 
and long to present here, but we will now show the solution for the cubic. Consider any 
cubic polynomial ax? +bxr?+cx+d. Without loss of generality we will find the roots of 
the monic cubic 7? +ba?+cx+d (simply multiply both sides of az? +bx?+cr+d=0 
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by a+). Again, without loss of generality, we consider cubic polynomials of the form 
x>+cx+d (replace x by x— 8 in the monic cubic above). Then the roots of 2?+cx+d 
can be shown to be of the form 


JWH+Y@), (WatC¥a, Ciati, 


1 1 1 
ee ae a 8V3Ai), w= 5 (27d — 3V3Ai), 


and 
A= V—4c3 — 27d3. 


Many years passed with people attempting to find a formula for the quintic. In 
the 1800’s it was suggested by LaGrange that perhaps there were no such formula, 
but it was Abel who at the age of 19 proved once and for all that there was no general 
formula for the quintic and thus put to rest the search for a general formula. Later 
in the century Galois gave a criterion for when a polynomial will have a solution 
by radicals. It was then shown that polynomials of a degree n > 5 have no general 
formula for solution by radicals. 

Our first goal is to obtain this criterion set out by Galois, which is aptly called 
the Galois Criterion for Solution by Radicals. 

We now present the formal definition of what it means for a polynomial to be 
solvable by radicals. 


Definition 10.16 Let F' be a field. A simple extension F(a) is called a radical 
extension if a” € F. The element a is called an nth root of a”. 


Note that a radical extension is an algebraic one, since the nth root a is a root of 
pee Fig. 


Example 10.17 Here, we give some examples of radical extensions. 
1. Q(V2) is a radical (quadratic) extension of Q, since (V2)? =2€Q. 


2. SetC=-F+ ¥3j, Then Q(¢) is a radical (cubic) extension of Q, since C = 
1EQ. 


Definition 10.17 Let F be a field. A polynomial f(x) € F [a] is solvable by rad- 
icals if there exists a chain of radical extensions F = Fy C Fi C -:: C Fy, (ie. 
Fyi1 = Fi(a;) where a7? € F; fori =0,1,...,2 —1) such that all the roots of f(z) 
lie in F,,. The chain of subfields is then called a root tower for f(x). 


Remark 10.6 A chain of fields F = Pp C Fy C--- C Fy such that F, is a root 
tower over F' is equivalent to the property that aj! € F and a;" € F(aj,a2,...,@:-1) 
for positive integers i = 2,...,n. In particular, Fy, = F(a1, d2,..., Qn). 
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Example 10.18 Here, we give some examples of root towers. 


1. Consider the polynomial f(x) = x? + 4x — 1 € Q{z] whose roots are —2 + V3. 
Then f(x) is solvable by radicals via the root tower Q = Fy C Fy = Q(V3). 


2. In general, any quadratic polynomial f(x) = ax? + br +c € Q{a] is solvable by 
radicals via the root tower Q = Fo C Q( Vb? — 4ac). 


3. Consider the polynomial f(x) = (x? + 4x — 1)(x* + «+ 1) which has roots 
24+ 3, —4 + V3 i, Then f(x) is solvable by radicals via the root tower 


Q=h& CH =Qv3) C Q(V3, 4). 


Note that root towers are not unique either in length or in intermediate fields, 
for a given polynomial solvable by radicals. For instance, in this example, we 


could have made F, = Q(t). 


4. Consider the cubic polynomial f(x) = 23 +cx+d € Q{z]. Using the cubic 
formula, we see that f(x) is solvable by radicals via the root tower 


QC Q(V3) C Q(V3, 4) C Q(V3, 7, A) C Q(V3, i, A, Ya) 
C Q(V3,1,A, 7x, 72). 


Lemma 10.6 Let E be the splitting field of f(x) = x" —1 € Fla] where charF = 0. 
Then Gal(E/F) is abelian. 


Proof 10.21 Since f’(z) = nx”! 4 0 and gcd(f, f’) = 1 we know that f(x) has 
distinct roots. Set H equal to the roots of f(x). Note that H < E*. Indeed, if a,b € H, 
then a” = 1 and b” = 1 and so (ab“!)" = a®™(b")-! = 1 which implies ab“! € H. 
Since any finite subgroup of the multiplicative group of a field is cyclic it follows 
that H is cyclic. Enumerate the subgroup H = {hy,ho,...,hn} and consider the 
map UV : Gal(E/F) — Aut(H) where U(¢)(hi) = O(hi). First note that V maps 
into Aut(H), since @ permutes the roots of H and is a field homomorphism. It’s 
easy to check that V is a homomorphism with trivial kernel (since ¢ is completely 
determined by where it sends the roots of f(x)). Now Aut(H) = Aut(Z,) = U(Zy) 
which is abelian. Hence, Gal(E'/F’) embeds in an abelian group and therefore must be 
abelian. 


Definition 10.18 The roots of <" —1 € F[x| with charF = 0 are called the nth 
roots of unity. Any generator of the cyclic group of nth roots of unity is called a 
primitive nth root of unity. 


Lemma 10.7 Let F be a field of characteristic 0 containing all the nth roots of unity. 
Let c € F and set f(x) = 2" —c€ Fs] and let E be the splitting field of f(x). Then 
Gal(E/F) is cyclic of order a divisor of n. 
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Proof 10.22 Fiz a € E a root of f(x) and ¢ € F a primitive nth root of unity. 
Observe that the n roots of f(x) are of the form C'a fori =0,1,...,n—1, since 


f(Cia) = (Ca)" —¢ = (")'a" —c = 1-¢- 0 =0. 


Therefore, E = F(a, a, C2a,...,¢"- 1a) = F(a), because F contains all the nth 
roots of unity. Hence, any element of Gal(E/F’) is completely determined by where 
it sends a and we know it must send a to another root of f(x), i.e. d(a) = Ca for 
some i € {0,1,...,n—1}. Therefore, we can define the map V : Gal(E/F) > Z,, 
by U(¢) = 7 where ¢(a) = Ca. It’s easy to check that V is a monomorphism and 
so Gal(E/F’) embeds in a cyclic group of order n and so must be cyclic of order a 
divisor of n. 


Lemma 10.8 Let p be a prime and F a field containing all the pth roots of unity. 
If E is a field containing F such that |E : F| = |Gal(E/F)| =p, then E is a radical 
extension of F where E = F(u) and u? € F. 


Proof 10.23 Set G = Gal(E/F) and fix ana € E—F. Since [E : F| is prime it 
must be that E = F(a). Set H = {G1,¢2,...,G)} the pth roots of unity in F. Since 
Gal(E/F) has prime order, it must be cyclic. Set Gal(E/F’) = (@). Define 


a1=a, a2=¢(a1), a3 = (a2), ..., @ = P(ap-1). 
Set uj = a, + Gag +--+ + (ie oe Notice that 
(us) = b(a1) + Gb(ag) ++: + CP b(ay) = ag + Gag tes + CP ay t+ Play 


= Cla t+ Gag +--+ Play) = Goh. 


Therefore, (ur) = ¢; Pu = uP. Hence, ub € ES = F, since E is Galois over F. 
Claim 10.3 E = F(u;) for some i € {1,2,...,p} (which proves the lemma) 


Let’s express in matrix form the identities we have derived thus far concerning 
the u;’s: 


U1 G1 1 1 ay 
1 

TD) TG: “Gs 4 ag 

Up 1 G GS mee ae Ap 


The coefficient matrix is called the Vandermonde matrix and one can show it 
is invertible by computing the determinant to be Ties (4 —G) £0, since the ¢’s are 
distinct and charF = p. So the linear system has a solution in ay, d2,...,Qp) in terms 
of U1, U2,...,Up. In particular, a = a, € F(uj,u2,...,Up). Thus, not all the u; are in 
F for otherwise a € F, a contradiction. Let i be such that u, ¢ F. Then E = F(u) 
(again, since |E : F] is prime). 
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Lemma 10.9 Jf K has a root tower over F of some polynomial in F’, then there 
exists a field EF > K such that E is normal over F and E has a root tower over F of 
some polynomial over F. 


Proof 10.24 By Assumption, K = F(ay,...,@n) where aj’ € F and a" € 
F(a, @2,...,@;-1) fori = 2,3,...,n. Let f(x) be the product of all the minimal 
polynomials of the a; over F and set E to be the splitting field of f(x). Since E is 
a splitting field of a polynomial over F' we know that E is normal over F’. Since the 
elements of Gal(E/F) are determined by where they send the roots of f(x), each root 
of f(x) has the form $(a;) for somei € {1,2,...,n} and some ¢ € Gal(E/F). Notice 
also that for a given @ € Gal(E/F) we have ¢(a1)™ = ¢(a}') = aj! € F and for 
i = 2,3,...,n we have 


o(ai)”’ = d(a;*) € (F(a, a2,...,ai-1)) = F(¢(a1), O(a2),...,(ai-1)), 


since the image of an element in F(a1,d2,.-.,@n) by @ ts completely deter- 
mined by where it sends a1,a2,...,a4n. Enumerate the elements in Gal(E/F) = 
{1, b2,.--, On}. Using the elements {¢;(ai) : 1<i<n, 1 <j < k} we can 
construct a root tower for E over F. 


We will need the Fundamental Theorem of Algebra for our final result which 
will illustrate the insolvability of the quintic. Our goal for the moment will be this 
important result. But first we need some additional results about separability. 


Lemma 10.10 /f FE is algebraic over F and char(F’) = 0, then E is separable over 
F. 


Proof 10.25 Take anya € E and set p(x) € F |x] to be the irreducible polynomial of 
a over F’. Since deg(p') < deg(p) and p(x) is irreducible, it follows that gcd(p, p’) = 1, 
and since char(F’) = 0 it must also be the case that p'(x) is not the zero polynomial. 
Hence, by an earlier result, p(x) has no multiple roots. 


Lemma 10.11 /f FE is finite and separable extension of F', then E is a simple ex- 
tension. 


Proof 10.26 We may assume that F is infinite, for it is always the case that a finite 
extension of a finite field is a simple extension. Since E is a finite and algebraic 
extension of F, we know that E is finitely generated over F. Set E = F(ay,...,@n) 
and the proof will be by induction on n, but to avoid a messy presentation we will 
simply prove the case of two generators and this proof easily generalizes. So set E = 
F(a,b), let p(x) be the minimal polynomial of a over F' of degree k and q(x) the 
minimal polynomial of b over F' of degree m. Let L be a field in which both p(x) and 
q(x) split. Since a and b are both separable over F the distinct roots of p(x) and q(x) 
are @ = @1,d2,...,a% and b = bi, be,...,bm (respectively). Choose c € F such that 
a; + be # at be for alli = 1,2,...,k and j = 2,3,...,m (such ac exists since 
a; + bjz = a+ bx has a unique solution, namely (a —a;)(b; — b)~', and F is infinite). 
Sett=a+cb. 
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Claim 10.4 E = Fit). 


Certainly, F(t) C E, since t € E. For the reverse inclusion, its enough to show 
that b € F(t), for then a = t — cb € F(t) and so F(a,b) C F(t). In order to show 
b € F(t), we show that the irreducible polynomial of b over F(t) has degree 1 from 
which it follows that |F(b,t) : F(t)] =1 and so F(b,t) = F(t) and be Fit). 

Let r(x) be the irreducible polynomial of b over F(t) and set f(x) = p(t — cx) € 
F(t)[z]. Since b is a root of both f(x) and q(x) it follows that r is a common divisor 
of f and gq in F(t)|z]. Now t— cb; 4 a; fori =1,2,...,k and j = 2,3,...,m by how 
c was defined and so b; is not a root of f(x) for j = 2,3,...,m. Hence, x — b; does 
not divide f(x) in F(#)[z] for 7 =2,3,...,m. Now q(x) = (a—b1)(a@— be) --- (4@—bm) 
in F(t)[x] and so gcd(f,q) =x —b, = r(x). 


We now prove the Fundamental Theorem of Algebra. We make use of four facts 
which we do not expressly prove here, but are easily attainable. 


1. Any polynomial f(a) € R[z] of odd degree has a real root. 


2. Every complex number has square roots which are also complex numbers. 
Hence, there are no irreducible quadratics over C and so there are no quadratic 
extensions of C. 


3. If f(x) € Cz], then f(x) f(x) € R, where f(x) is the polynomial obtained by 
replacing all the coefficients in f(x) by their complex conjugates. 


Theorem 10.11 (Fundamental Theorem of Algebra) C is algebraically closed. 


Proof 10.27 We use the Fundamental Theorem of Galois Theory so many times in 
this proof that for brevity we shall call it FTG. Let f(x) € Cla] and we show f(x) has a 
root in C. Let E be the splitting field of f(x) over C and set g(x) = (a7+1) f(x) f(x) € 
R[x]. Let L be the splitting field of g(x) over R. Note that C C E C L, since L 
contains R, 7%, and the roots of f(x). We will show that L = C and so E=C 
and the result is proved. Since char(R) = 0 we know by Lemma 10.10 that L is 
Galois over R. Set G = Gal(L/R), let H be a 2-Sylow of G (which may be trivial) 
and K = L". Certainly, [G : H] is odd and so by FTG, so is [K : R]. Since 
char(R) = 0, by Lemma 10.10 and Lemma 10.11, we know there is ana € K such 
that K = R(a). Therefore, |R(a) : R] is odd and so the irreducible polynomial of a 
over R is odd. However, we know every polynomial of odd degree over R has a real 
root, thus it must be the case that the minimal polynomial of a over R is linear. But 
then [Kk : R] = [R(a) : R] = 1 and so K = R which in turn implies G = H is a 
2-group. Set Gy, = Gal(L/C) which is a 2-subgroup of G. 


Claim 10.5 G, =1. 


Suppose not. Set |G,| = 2". By the First Sylow Theorem, there exists a subgroup 
of Gy of order 2*-! and since it has index 2 in Gy it must be normal in G,. Call it 


N <G, and set K = LN. By FTG, [K : C] = [G, : N] = 2. Since char(C) = 0, 
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by Lemma 10.10 and Lemma 10.11, K = C(a) for somea€ K. But this contradicts 
that C has no quadratic extension and thus the claim is proved. 

Having proved the claim we see now that Gal(L/C) = Gi = 1 and so [L: C] = 
|Gal(L/C)| =1, which implies that L = C, and so E=C. 


Theorem 10.12 (Galois Criterion for Solvability by Radicals) Let f(x) € 
F\a| and char(F’) = 0. Let E be the splitting field of f(x) over F. Then f(x) is 
solvable by radicals iff Gal(E/F) is a solvable group. 


Proof 10.28 First assume that f(x) is solvable by radicals. Therefore, there exists 
F=fFfoCF,C.---C F, =L a root tower for f(x) over F. Let E be the splitting 
field for f(x) over F. Since E is the smallest field containing F and the roots of f(x) 
we know that EF C L. By Lemma 10.9, we may assume that L is normal over F and so 
L is the splitting field of some polynomial g(x) € Fx] over F’. L being a root tower of 
f(x) over F we know that each Fi41 = Fj(a;) with a;" € F; for some positive integer 
n; (i=0,1,...,m-1). Set n equal to the least common multiple of all the n;’s and ¢ a 
primitive nth root of unity. Note that F(¢) = Fo(¢) C Fi(¢) C +--+ © Fin() = LC) 
is a chain of radical extensions as well as the splitting field of g(x)(a" — 1). Thus, 
L(¢) is normal over F(¢) and since char(F’) = 0, by Lemma 10.10, L(¢) is Galois 
over F(¢). Since F C E C L(C) and E is normal over F’, by FTG, we know that 


Gal(E/F) = Gal(L(¢)/F)/Gal(L(¢)/E). 
Thus, it’s enough to show Gal(L(¢)/F’) is a solvable group, by Theorem 10.10. 
Claim 10.6 Gal(L(¢)/F) is a solvable group. 


Seb he 1 FC Gis ics) Mind Ne = Gall CAG) for C= 12a nt, SG = 
Gal(L(¢)/F) and N = Gal(L(¢)/F(¢)). We will show that 1d Nm <---I Ni INAG 
is a series of subgroups of G such that the quotient group of adjacent subgroups is 
abelian and so G is solvable. Note that by how ¢ was defined, C/") is a primitive 
nth root of unity and so K; is the splitting field of «% — ¢°/™)a,; over Fi for 
i = 2,3,...,m. Hence, F; is normal over F;_1 and by FTG, N;-1 dN; fori = 
2,3,...,m. Furthermore, by Lemma 10.7, Ni-1/Ni = Gal(F;/Fi-1) is cyclic and 
therefore abelian. Finally, NAG and by Lemma 10.6, G/N = Gal(F(¢)/F) is abelian. 


Therefore, to prove the insolvability of the quintic, it is enough to produce a 
polynomial of degree five such that the Galois group of its splitting field over its field 
of coefficients is not solvable. We will do exactly this by constructing a polynomial 
whose Galois group is $5 which we know is not solvable. This construction can be 
generalized to any prime p > 5 and so this construction can be used to show the 
insolvability of polynomials of degree any prime > 5. For simplicity we will just show 
p = 5. We decided not to show the insolvability of polynomials of arbitrary degree > 5, 
since this involves more preliminaries regarding what are called symmetric functions. 

Consider the function g(a) = (x? + k)(a — 1)(x — m)(a — n) where k,l,m,n are 
even integers, k > 0 and 1 < m < n. Certainly g(x) has exactly three real roots, 
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Figure 10.3 The graph of a fifth degree polynomial. 


namely /,m,n and by consider sign changes for outputs of g(x), it is clear the graph 
has the general shape depicted in Figure 10.3. 

Therefore, g(x) has exactly one local maximizer c and one local minimizer d. Now 
the local maximum g(c) has value larger than 2, since at any odd integer r, we have 


g(r) 


Hence, the polynomial f(x) = g(x) — 2 has the same local minimizer and max- 
imizer with f(c) > 0 and so f(z) also has exactly three real roots. The reason we 
shifted from g(x) to f(a) is because (as we shall see) f(x) is irreducible, a property we 
will need. To see why f(z) is irreducible, first note that all but the leading coefficient 
of g(x) is divisible by 2 and its constant coefficient is divisible by 4. Therefore, the 
same holds true for f(x) except that now 4 does not divide its constant coefficient. 
Therefore, by Eisenstein’s Criterion using p = 2 it follows that f(a) is irreducible 
over Q. 


= |r? +k\|[r — |r — ml|r —n| > |r? +k) =r? +k > 2. 


Lemma 10.12 I[f a subgroup H of Ss contains a transposition and a 5-cycle, then 
H= Ss. 


Proof 10.29 By renumbering we may assume that (1 2) € H. For the 5-cycle o € 
H there exists a positive integers k,m,n such that (1 2k mn) € H. Again, by 
renumbering we may assume that (1 23 45) € H. But then (1 2)(1 23 45) = 
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(2345) € H. Therefore, 
(2345)?=(24)(35)€H and (2345)? =(2543)E8. 
(234 5)(1 2)(25 43) =(1 3) ER, 
(2 4)(3 5)(1 2)(2 4)(3 5) = (1 4) EH, and 
(26 43)\(1 22 345) = (15) eR. 


Therefore, for any transposition (mn) € S5, we have (mn) = (1 n)(1 m)(1 n) € 
H. Since any permutation is a product of transpositions, it follows that Ss = H. 


Lemma 10.13 Let f(x) = (a7 + k)(x — 1)(x — m)(x — n) — 2 where k,l,m,n are 
even integers, k > 0 andl <m<nand E the splitting field of f(x) over Q. Then 
Gal(E'/Q) = Ss. 


Proof 10.30 Since C is algebraically closed we know that E C C. Set G = Gal( E/Q) 
and since each element of G permutes the 5 roots of f(x) we know G < S5. Leta € E 
be one of the five roots of f(x) and set K = Q(a). Since f(x) is irreducible over Q 
we know [Kk : Q| = 5. Since char(Q) = 0 we know that E is Galois over Q and so 
by FTG, |G : Gal(E/K)| = [K : Q| = 5 which implies 5 divides |G|. By Cauchy’s 
Lemma, G has an element of order 5 which corresponds to a 5-cycle in Ss. Since f(x) 
has exactly 3 real roots and 2 complex conjugate roots, it follows that the map in G 
which sends one complex root to its conjugate has order two and thus corresponds to 
a transposition in Ss. Therefore, by Lemma 10.12, it follows that G = Ss. 


Corollary 10.5 Not every polynomial of degree 5 is solvable by radicals. 


Proof 10.31 Apply the Galois Criterion for Solvability by Radicals to the polynomial 
defined in Lemma 10.13 which has Galois group Ss which we know is not a solvable 


group. 


EXERCISES 


1 Inthe proof of Lemma 10.6, check that V is a homomorphism with trivial kernel. 
2 In the proof of Lemma 10.7, check that V is a monomorphism. 
3 Verify that the Vandermonde matrix is invertible. 


4 Prove the following three facts that were used in the Fundamental Theorem of 


Algebra: 


a. Any polynomial f(a) € R{«] of odd degree has a real root. 


b. Every complex number has square roots which are also complex numbers. 
Hence, there are no irreducible quadratics over C and so there are no 
quadratic extensions of C. 
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c. If f(x) € Cz], then f(x) f(x) € R, where f(z) is the polynomial obtained 
by replacing all the coefficients in f(x) by their complex conjugates. 


5 In the proof of Theorem 10.12, verify that all but the leading coefficient of g(x) 
is divisible by 2 and its constant coefficient is divisible by 4. 


6 In the proof of Lemma 10.12, explain why for the 5-cycle o € H there exists a 
positive integers k,m,n such that (1 2k mn) € H. 
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Gaussian integers, 193 
general linear group, 25, 92 
general linear group of a vector space, 92 
greatest common divisor, 18, 199 
group, 24 
abelian, 25 
action, 102 
fixator, 103 
orbit, 104 
stabilizer, 103 
transitive, 107 
alternating, 45 
automorphism, 56 
inner, 56 
automorphism group, 58 
cancellation property, 27 
center, 148 
conjugacy class, 104 
coset, 65 
cyclic, 37 
direct sum, 52 
embedding, 56 
endomorphism, 56 
epimorphism, 56 
exponentiation, 30 
external product, 50 
external semidirect product, 77 
factor, 71 
free, 136 
basis, 136 
rank, 138 
words, 138 
free abelian, 140 
Galois group, 248 
generated by, 37 
generator, 37 
homomorphism, 54 
canonical, 75 
kernel, 56 
natural, 75 


normal, 149 
inner automorphism group, 58 
internal direct sum, 52 
internal product, 52 
internal semidirect product, 76 
internal sum, 52 
isomorphic, 60 
isomorphism, 56 
latin square, 28 
metabelian, 158, 159 
monomorphism, 56 
multiplication table, 27 
nilpotency class, 159 
nilpotent, 159 
normalizer, 104 
order, 27 
p-group, 122 
p-subgroup, 122 
p-Sylow subgroup, 122 
number of, 125 
presentation, 140, 141 
quotient, 71 
representation, 144 
faithful, 144 
permutation, 146 
set product, 51 
simple, 82 
solvable, 158, 159, 271 
solvable class, 158, 159 


split, 76 
subgroup, 32 
center, 34 


centralizer, 34 
characteristic, 148 
commutator, 152, 273 
derived, 152, 273 
generated by, 34, 150, 272 
improper, 34 
index, 67 
normal, 70 
nth center, 155 
nth derived, 155 
trivial, 34 
symmetric group, 12 
trivial, 27 


hyperplane, 94 


integral domain, 170 
Euclidean domain, 196 
factorization, 203 
principal ideal domain, 196 
subdomain, 170 
unique factorization, 204 
unique factorization domain, 204 
irreducible, 200 


order 
element, 37 
group, 27 


partition, 7 
Polya’s Formula, 111 


polynomial 
content, 210 
degree, 188 


derivative, 241 
irreducible, 189 
irreducible of/over, 222 
leading coefficient, 188 
minimal of/over, 222 
monic, 188 
primitive, 210 
reducible, 189 
root, 191 
nth root, 276 
nth root of unity, 277 
primitive nth root of unity, 277 
separable over, 260 
simple zero, 260 
splits, 239 
zero, 191 
poset, 252 
chain, 253 
inductively ordered, 253 
maximal element, 253 
upper bound, 253 
power set, 3 
prime, 18, 200 
projective general linear group, 95 
projective special linear group, 95 
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relation, 3, 10 


anti-symmetric, 4 
congruence, 20 
equivalence relation, 4 
function, 4 

irreflexive, 4 

linear ordering, 252 
partial ordering, 4, 252 
reflexive, 4 

symmetric, 4 
transitive, 4 


ring, 165 


cartesian product ring, 166 
center, 168 
characteristic, 186 
endomorphism ring, 166 
epimorphism, 177 
factor ring, 178 
homomorphism, 176 
canonical, 181 
evaluation, 216 
kernel, 176 
isomorphism, 177 
matrix ring, 166 
monomorphism, 177 
nilpotent element, 189 
of polynomials in, 216 
polynomial ring, 166, 216 
quotient ring, 178 
subring, 167 
ideal, 178 
trivial ring, 167 
unit, 167 
zero ring, 167 


root tower, 276 


separable over, 260 
series, 154 


abelian, 154 
central, 154 
derived, 156, 273 
length, 154 

lower central, 156 
normal, 154 
subnormal, 154 
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term, 154 

upper central, 157 
skew field, 168 

quaternions, 168 
skewfield 

hamiltonion integers, 174 

quaternions, 172 
solvable by radicals, 276 
special linear group, 25, 92 
symmetric group on n, 12 

k-cycle, 42 

cycle type, 44 


disjoint cycles, 42 
transposition, 42 
even and odd, 45 


transcendental over, 217 
transvection, 94 


unipotent group, 25, 92 
unit, 22 


Vandermonde matrix, 278 


zero divisor, 22, 169 


